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INSECT VIRUSES AND THEIR USES IN PROTECTING PLANTS 
FIELD OF THE INVENTION 

The present invention relates to insect viruses useful in control of insect attack on 
5 plants. It particularly relates to biological insecticides, especially those comprised of 
insect viruses. In particular applications, the invention also provides recombinant 
viruses and transgenic plants. 

BACKGROUND OF THE INVENTION 

10 There is increasing awareness of the desirability of insect pest control by biological 

agents. Considerable effort in recent years has been devoted to the identification and 
exploitation of DNA viruses with large genomes, especially the baculoviruses. These 
viruses generally require extensive genetic manipulation to become effective 
insecticides, and the first such modified viruses are only now being evaluated. 

15 

In contrast, very little effort has been devoted to the study and use of small viruses 
with RNA genomes. 

Four main groups of small RNA viruses have been isolated from insects. These 
20 include members of the Picornaviridae, the Nodaviridae, the Tetraviridae and the 
unclassified viruses. Descriptions of these groups can be found in the Atlas of 
Invertebrate Viruses (eds J.R. Adams and J. R. Bonami) (CRC Press, Boca Raton, 
1991) and Viruses of Invertebrates (ed. E. Kurstak) (Marcel Dekker, New York, 1991). 
These disclosures relating to these viruses concern their pathology and biology, not 
25 their use in biological control. 

Further information regarding small RNA viruses of insects an be found in P.D. Scotti 
et al (1981) "The biology and ecology of strains of an insect small RNA virus 
complex" Advances in Virus Research 26, 1 17-143. This review describes the insect 
30 picornaviruses cricket paralysis virus and Drosophila C virus (diameters estimated at 
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27-30 nm with one RNA component of 7.5 - 8.5 kb). N.F. Moore & T.W. Tinsley 
(1982) The small RNA viruses of insects. Brief review Archives of Virology 72, 229- 
245. This review included viruses of the following families: 

Nodaviridae (diameter 29-30 nm, 2 RNA components totalling 4.5 kb) 
5 Picornaviridae (diameter 27-30 nm, one RNA component of 7.5 - 8.5 kb) 

Nudaurelia p family (now called Tetraviridae) (diameter around 35 nm, either 

one RNA of 5.5 kb or two totalling 8 kb) 

N.F. Moore, B. Reavy & L A. King (1985) General characteristics, gene organisation 
10 and expression of small RNA viruses of insects. Journal ofgenerhl Virology 66, 647- 
659. This reference defines small RNA viruses of insects as being those less than 40 
nm in diameter. The review covers Picornaviridae, Nodaviridae and the Nudaurelia p 
family (now called Tetraviridae). 

15 D. Hendry, V. Hodgson, R Clark and J Newman (1985) Small RNA viruses co- 
infecting the pine emperor moth (Nudaurelia cytherea capensis). Journal of general 
Virology 66, 627-632 described viruses with mean diameters of 40nm and 38nm and 
one or two RNA components up to 5.5 kb in length. 

20 Most recently, the term insect small RNA viruses has been used by one of the present 
inventors to cover three main recognised toxic groups: the Picornaviridae, the 
Tetraviridae and the Nodaviridae (P.Scotti & P.Christian (1994) The promises and 
potential problems of using small RNA insecf viruses for insect control. Sains 
Malaysiana 23, 9-18). 
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These references illustrate a long standing usage of the term in this field of the term 
"small RNA virus" for viruses with certain characteristics as listed above. Another 
important characteristic of these virus groups is that they are not occluded, in contrast 
5 to many large viruses like the cytoplasmic polyhydrosis (RNA) viruses or the DNA 
baculoviruses, granulosis viruses and entomopox viruses. The term would also be 
applied to viruses not members of the three families listed above, as long as they 
satisfied the definition of being up to 40nm in size. There are reports of such 
unclassified viruses (eg in Hendry et al 1985). Moreover, the taxonomic status of 
10 some members of the Tetraviridae still requires clarification and it 'might even be 
possible for this family to be split, with HaSV and other members with two RNA 
components in their genome being separated from those with only one component, like 
the type member Nudaurelia p virus, which has not yet been sequenced. The above 
definition of "small RNA virus" would still cover all members of such virus families. 

15 

SUMMARY OF THE INVENTION 

In a first aspect of the present invention there is provided an isolated small RNA virus 
wherein the virus is up to 40nm in size, is not occluded and infects insect species 
20 including Heliothis species. 

In one particular embodiment, the present invention provides an isolated preparation of 
Heliothis armigera stunt virus referred to as "HaSV" herein. 

25 In a further aspect of the present invention there is provided an isolated nucleic acid 
molecule comprising a nucleic acid sequence hybridizable with RNA 1 (SEQ ID No: 
39) or RNA 2 (SEQ ID No: 47) described herein under low stringency conditions. 
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In still a further aspect the invention provides a vector comprising a nucleic acid 
molecule, the sequence of which is hybridizable with RNA 1 (SEQ ID No: 39) or RNA 
2 (SEQ ID No: 47) as described herein. These vectors include expression and transfer 
vectors for use in animals including insect, plant and bacterial cells. 

5 

In a further aspect the invention provides an isolated protein or polypeptide 
preparation of the proteins or polypeptides derivable from the isolated virus of the 
present invention. The invention also extends to antibodies specific for the protein and 
polypeptide preparations. 

10 

In a yet further aspect the invention provides a recombinant insect vims vector 
incorporating all or a part of the isolated virus of the present invention. 
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In a still further aspect of the present invention there is provided a method 
of controlling insect attack in a plant comprising genetically manipulating 
said plant so that it is capable of producing HaSV or mutants, derivatives or 
variants thereof or an insecticidally effective portion of HaSV, mutants, 
5 variants or derivatives thereof such that insects feeding on the plants are 

deleteriously effected. The present invention also provides a transgenic plant 
so manipulated. 

BRIEF DESCRIPTION OF FIGURES 

10 

Figure la is a restriction map of RNA 1 (SEQ ID No. 39) 
clones. 

Figure lb is a restriction map of RNA 2 (SEQ ID No. 47) ' 
clones, 

15 Figure 2 is the complete sequence of RNA 1 (SEQ ID 
No. 39) and of major encoded polypeptide. 
Figure 3 a is the complete sequence of RNA 2 (SEQ ID 
No. 47) in the 

authentic version, and its encoded polypeptides. 
20 Figure 3b is the sequence of RNA 2 variant (a 5C 
version) (SEQ ID No. 51) and its major encoded 
polypeptide(s). 

Figure 4 is bioassay data showing HaSV induced 
stunting of larvae. 
25 Figure 5 is a map of Vector plasmid pT7T2b and 
PT7T2c. 

Figure 6 is a schematic representation of the proteins 
encoded by RNA 1 

(SEQ ID No. 39) and RNA 2 (SEQ ID No. 47). 
30 Figure 7 is a schematic representation of the proteins 

expressed by RNA 2 (SEQ ID No. 47) in bacteria DNA 



fragments encoding P17 (SEQ ID No. 48), P71 (SEQ ID 
No. 50), P64, P7 and the fusion protein P70 (SEQ ID No. 
52) were synthesized by PCR. The flanking Ndel and 
BamHI sites used in 

cloning are indicated. (Note that P17 is followed by 
BgUI site, whose 

cohesive ends are compatible with those of BamHI). 



Figure 8 illustrates the 3 '-terminal secondary structure of 

HaSV RNAs. The tRNA-like structures at the 3' ends of 

RNAs 1 and 2 (SEQ ID No. 39 & 

47) are shown. Residues in bold are common to both 

sequences. 

Figure 9 Expression strategies for HaSV cDNAs in insect 
cells. The upper 

part of the figure shows the genome organization of 
RNAs 1 and 2 (SEQ ID Nos. 39 & 47). The lower part 
shows insertion of cDNAs corresponding to these RNAs 
into a plasmid vector, between heat shock protein 
(HSP70) promoter of Drosophila and a suitable 
polyadenylation (pA) signal. The 
HSP promoter was obtained by PCR using suitable 
primers, with a BamHI 

site inserted by PCR immediately upstream of the start of 
the transcription, giving the following sequence: 
GGATCCACAGnnn (SEQ ED No. 1), where the 
underlined residue is the transcription start site for either 
RNA. The cDNAs are terminated by Clal sites, allowing 
direct linkage to ribozyme sequences as described in the 
text. 

Figure 10 Ribozymes to yield correct 3' ends. The 
sequences of ribozymes inserted as short cDNA 
fragments into HaSV cDNA clones are shown. The 
ribozyme fragments were assembled and cloned as 
described in the text. Designed self-cleavage points are 
indicated by bold arrows. 

Figure 1 1 Immunoblots to map epitopes on HaSV. A. 
Detected with HaSV antiserum. Lane 1: pTP70delSP; 
lane 2; pTP70; lane 3: pTP17; lane 4: 



control; lane 5: pTP70delN; lane 6: pTP70; lane 7: 

pTP71; lane 8: HaSV virions; lane 9: molecular weight 

markers. B. Detected with HaSV 

antiserum. Lane 1: pTP70delN; lane 2: pTP70 delSPN; 

lane 3 : pTP70. C. Detected with an antiserum to the Bt 

toxin (CrylA(c)). Lane 1: pTP70; lane 

2: HaSV virions; lane 3: control extract. 

Figure 12 New field isolates of HaSV. The genomic 

organization of RNA 2 

is shown at the top of the Figure. PCR using appropriate 
primers with 

BamHI restriction sites and in some cases altered context 
sequences of the 
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AUG initiating translation of the P17 (SEQ ID No. 48) or 
P71 (SEQ ID No. 50) genes were used to make 
fragments for cloning into the BamHI sites of 
the expression vectors. Constructs 17E71 (SEQ ID No. 
5 35)andP71 (SEQ 

ID No. 50) have altered context sequences of the AUG 
initiating translation 

of the P17 (SEQ ID No. 48) and P71 (SEQ ID No. 50) 
genes respectively; these alterations correspond to the 
10 context derived from the JHE gene (see text). All 

context sequences are given on the right of the Figure. 
R2isa 

clone of the complete RNA sequence as a BamHI 
fragment in the vector. 
15 Figure 13 Maps of the expression constructs in 
baculovirus vectors. 

Figure 14a to e Various strategies utilizing the present 
invention. 

Figure 15 Expression of RNAs 1 and 2 (SEQ ID Nos. 39 
20 and 47) from baculovirus vectors. The full length cDNA 
clone of HaSV RNA 1 or 2 
(SEQ ID Nos. 39 & 47) was inserted as a BamHI 
fragment into the baculoexpression vectors. PCR. was 
used to add BamHI sites immediately adjacent to the 5' 
25 and 3' termini of the RNA 1 sequence; sequences of the 
primers are given in the text. Constructs R1RZ and 
R2RZ carry cis-acting ribozymes immediately adjacent to 
the 3' end of the sequence of RNA 1 
and 2 (SEQ ID Nos. 39 & 47) respectively. 
30 Figure 16 Expression strategies for HaSV cDNAs in 
plant cells. The upper part of the Figure shows the 
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genome organization of RNAs 1 and 2 (SEQ ID Nos. 39 
& 47). The lower part shows insertion of cDNAs 
corresponding to these RNAs into a plasmid vector, 
between 35S promoter of cauliflower mosaic virus and 
5 the polyadenylation (p A) signal on plasmid pDH5 1 
(Pietrzak et al, 1986). The cDNAs were obtained by 
PCR using suitable 

primers, with a BaMHI site immediately upstream of the 
start of each cDNA The cDNAs are terminated by Clal 
10 sites, allowing 

direct linkage to ribozyme sequences as described in the 
text. 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

A first aspect of the invention contemplates use of small RNA viruses for biological 
control of insects. In particular, in accordance with the first aspect of this invention 
there is provided an isolated small RNA virus, particularly K armigera stunt virus or 
5 mutants, variants or derivatives thereof capable of infecting insects, in particular the 
insect species such as Helicoverpa armigera. The small RNA virus isolate of the 
instant invention is insecticidal and in particular stunts the growth of insect larvae, for 
example Helicoverpa armigera larvae and inhibits or prevents development into the 
adult stage. 

10 

The small RNA viruses of the instant invention have insecticidal, anti-feeding, gut- 
binding or any synergistic property or other activity useful for insect control. 

In particular, Helicoverpa armigera stunt virus (HaSV) particles are isometric and 
15 approximately 36 nm in diameter with a buoyant density on CsCl gradients of 

1.36g/ml. The virus is composed of two major capsid proteins of approximately 64 
and 7 KDa in size as determined on SDS-PAGE. The HaSV genome is much later 
than the largest known nodavirus (another class of RNA viruses) and comprises two ss 
(+) RNA molecules of approximately 5,3 and 2.4 kb. The genome appears to lack a 
20 blockage of unknown structure at the 3' termini that is found in Nodaviridae. The 
HaSV genome however shares a capped structure and non-poly adenylation with 
Nodaviridae. HaSV differs significantly from Nodaviridae and Nudaurelia w virus in 
terms of its immunological properties. In particular the large capsid protein has 
different antigenic determinants. Other properties of HaSV are described in the 
25 Examples. 
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The host range of HaSV includes Lepidopterans such as from the subfamily 
Heliothinae. Species known to be hosts are Helicoverpa (Heliothis) armigera, 
H. punctigera, H. zea, Heliothis virescens and other such noctuides as Spodoptera 
exigua. K armigera which is known by the common names corn ear worm, cotton 
5 ball worm, tomato grub and tobacco bud worm is a pest of economic significance in 
most countries. H.punctigera, the native bud worm, is a pests of the great economic 
significance in Australia. Members of the Heliothinae, which include Helicoverpa and 
Heliothis, and especially H. armigera are among the most important and widespread 
pests in the world. In the US Heliothis virescens and Helicoverpa zea are particularly 
10 important pests. 

The first aspect of the invention provides an isolated small RNA virus capable of 
infecting insects including Heliothis species. In a particularly preferred form the 
invention relates to mutants, variants and derivatives of HaSV. The terms "mutant", 

15 "variant and "derivative" include all naturally occurring and artificially created viruses 
or viral components which differ from the HaSV isolate as herein described in 
nucleotide content or sequence, amino acid content or sequence, immunological 
reactivity, non-glycosylation or glycosylation pattern and/or infectivity but generally 
retain insecticidal activity. Specifically the terms "mutant", "variant" and "derivative" 

20 of HaSV covers small RNA viruses which have one or more functional characteristic 
of HaSV described herein. Examples of mutants, variants or derivatives of HaSV 
include small RNA viruses that have different nucleic or amino acid sequences from 
HaSV but retain one of more functional features of HaSV. These may include strains 
with genetically silent substitutions, strains carrying replication and encapsidation 

25 sequences and signals that are functionally related to HaSV, or strains that carry 
functionally related protein domains. 

In a preferred aspect the invention relates to mutants, variants or derivatives 2of HaSV 
which encode replication or encapsidation sequences, structures or signals with 60%, 
30 preferably 70%, more preferably 80%, still more preferably 90% and even more 
preferably 95% nucleotide sequence-identity to the nucleotide sequences HaSV. 



In another preferred aspect the invention relates to mutants, variants or derivatives of 
HaSV which encode proteins with at least 50%, preferably 60%, preferably 70%, more 
preferably 80%, still more preferably 90% and even more preferably 95% amino acid 
sequence identity to proteins or polypeptides of HaSV. 

5 

In another preferred aspect the invention relates to mutants, variants or derivatives of 
HaSV with 50%, more preferably 60%, still more preferably 70%, more preferably 
80%, still more preferably 90 or 95% nucleotide sequence identity to the following 
biologically active domains encoded by the HaSV genome: 
10 RNA 1 (SEQ ID No: 39) - amino acid residues 401 to 600 or the other 

domains in the replicase 
RNA 2 (SEQ ID No: 47) (in the capsid protein) 

amino acid residues 273 to 435 
amino acid residues 50 to 272 
1 5 - amino acid residues 436 to the COOH terminus 

Preferably the viral isolate of the present invention is biologically pure which means a 
preparation of the virus comprising at least 20% relative to other components as 
determined by weight, viral activity or any other convenient means. More preferably 
20 the isolates are 50% pure, still more preferably it is 60%, even more preferablyit is 
70% pure, still more preferably it is 80% pure and even more preferably it is 90% or 
more, pure. 

In a second aspect the present invention relates to a nucleotide sequence or sequences 
25 hybridizable with those of HaSV. The term nucleotide sequence used herein includes 
RNA, DNA, cDNA and nucleotide sequences complementary thereto. Such 
nucleotide sequences also include single or double stranded nucleic acid molecules and 
linear and covalently closed circular molecules. The nucleic acid sequences may be 
the same as the HaSV sequences as herein described or may contain single or multiple 
30 nucleotide substitutions and/or deletions and/or additions thereto. The term nucleotide 
sequence also includes sequences with sufficient homology to hybridize with the 
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nucleotide sequence under low, preferably medium and most preferably high 
stringency conditions (Sambrook J, Fritsch, E.F. & Maniatis T. (1989) Molecular 
Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbour Laboratories Press) 
and to nucleotide sequences encoding functionally equivalent sequences. In still a 
5 more preferred embodiment the invention comprises the nucleotide sequences of 

genome components 1 and 2 (SEQ ID Nos: 39 and 47) as represented by Figures 1 and 
2 hereinafter or parts thereof, or mutants, variants, or derivatives thereof The terms 
"mutants", "variants" or "derivatives" of nucleotide genome components 1 and 2 (SEQ 
ID Nos: 39 and 47) has the same meaning, when applied to nucleotide sequences as 
10 that given above and includes parts of genome components 1 and 2 (SEQ ID Nos: 39 
and 47). 

The second aspect of the invention also relates to nucleotide signals, sequences or 
structures which enable the nucleic acid on which they are present to be replicated by 
15 HaSV replicase. Furthermore the invention relates to the nucleotide signals, sequences 
or structures which enable nucleic acids on which they are present to be encapsidated. 

In a particularly preferred embodiment of the second aspect, the invention comprises 
nucleotide sequences which are mutants of the capsid gene having the following 
20 sequences: 

ATG GGC GAT GCC GGC GTC GCGT TCA CAG (SEQ ID No: 2) 
ATG GAG GAT GCT GGA GTG GCG TCA CAG (SEQ ID No: 3) 
ATG AGC GAG GCC GGC GTC GCG TCA CAG (SEQ ID No: 4) 

25 In a preferred aspect the invention relates to nucleotide sequences of HaSV encoding 
insecticidal activity including the capsid protein gene and P17 (SEQ ID No: 48) and 
mutants, variants and derivatives thereof. 

In another preferred aspect the invention comprises nucleotide sequences including the 
30 following ribozyme oligonucleotides: 
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5'CCATCGATGCCGGACTGGTATCCCAGGGGG (called "HVRlCla" herein) (SEQ 
ID No: 5) 

5' CCATCGATGCCGGACTGGTATCCCGAGGGAC (called "5'HVR2Cla" herein) 
5 (SEQ ID No: 6) 

5' CCATCGATGATCCAGCCTCCTCGCGGCGCCGGATGGGCA (called 
"RZHDV1" herein) (SEQ ID No: 7) 

10 5' GCTCTAGATCCATTCGCCATCCGAAGATGCCCATCCGGC (called 
"RZHDV2" herein) (SEQ ID No: 8) 

5' CCATCGATTTATGCCGAGAAGGTAACCAGAGAAACACAC (called 
"RZHCl" herein) (SEQ ID No: 9) 

15 

5' GCTCTAGACCAGGTAATATACCACAACGTGTGTTTCTCT (called "RZHC2" 
herein) (SEQ ID No: 10) 

Ribozyme sequences are useful for obtaining translation, replication and encapsidation 
20 of the transcript. It is therefore desirable to cleave the transcript downstream of its t- 
RNA-like structure or poly A tail prior to translation, replication or encapsidation of 
the transcript. 

The present invention also further extends to oligonucleotide primers for the above 
25 sequences, antisense sequences and nucleotide probes for the above sequences and 
homologues and analogues of said primers, antisense sequences and probes. Such 
primers and probes are useful in the identification, isolation and/or cloning of genes 
encoding insecticidally effective proteins or proteins required for viral activity, from 
HaSV or another virus (whether related or unrelated) carrying a similar gene or similar 
30 RNA sequence. They are also useful in screening for HaSV or other viruses in the 
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field or in identifying HaSV or other viruses in insects, especially in order to identify 
related viruses capable of causing pathogenecity similar to HaSV. 

Any pair of oligonucleotide primers derived from either RNA 1 or RNA 2 (SEQ ID 
5 Nos: 39 and 47) and located between ca 300 and 1500 bp apart can be used as primers. 
The following pairs of primer sequences exemplify particularly preferred embodiments 
of the present invention: Specifically for RNA 1 (SEQ ID No: 39): 
1 . HVR1B5* (SEQ ID No: 38) (described below) and the primer complementary 
to nucleotides 1192-1212 of Figure 1. 
10 2. The primer corresponding to nucleotides 4084 and 4100 of Fig. 13 and the 
primer HVR13p (SEQ ID No: 12) described below 

Specifically for RNA 2 (SEQ ID No: 47): ' 

1 . The primer corresponding to nucleotides 459 to 476 of Fig. 2 and the primer 
15 complementary to nucleotides 1653 to 1669 of Fig. 2 (this would include the central 

variable domain) 

2. R2cdha5 and the primer complementary to nucleotides 1 156 to 1 172 of Fig. 2 

3. The primer corresponding to nucleotides 1 178 to 1 194 and the primer 
complementary to nucleotides 2072 to 2091 (of Fig, 2). 

20 Other combinations giving shorter fragments are also possible. 

Further preferred primers include: 

5' GGGGGGAATTCATTTAGGTGACACTATAGTTCTGCCTCCCCGGAC (called 
25 "HvRlSP5p M herein) (SEQ ID No: 1 1) 

5' GGGGGGATCCTGGTATCCCAGGGGGGC (called M HvR13p n herein) (SEQ ID 
No: 12) 

30 5' CCGGAAGCTTGTTTTTCTTTCTTTACCA (called- n Hr2cdna5 " herein) 
(SEQ ED No: 13) 
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5' GGGGGATCCGATGGTATCCCGAGGGACGC 
TCAGCAGGTGGCATAGG (called "HvR23p" herein) (SEQ ID No: 14) 

AAATAATTTTGTTACTTTAGAAGGAGATATACAT ATGAGCGAGCGAGCACA 
5 C (called "HVPET65N" herein) (SEQ ID No: 15) 

AAATAATTTTGTTTAACCTTAAGAAGGAGATCTACAT ATGCTGGAGTGGCG 
TCAC (called "HVPET63N" herein) (SEQ ID No: 16) 

10 GGAGATCTACAT ATGGGAGATGCTGGAGTG (called "HVPET64N" herein) 
(SEQ ID No: 17) 

GTAGCGAACGTCGAGAA (called "HVRNA2F3 " herein) (SEQ ID No: 18) 

1 5 GGGGGATCCTC AGTTGTCAGTGGCGGGGTAG (called "HVP65C" herein) (SEQ 
ID No: 19) 

GGGGATCC CTAATTGGCACGAGCGGCGC (called "HVP6C2" herein) (SEQ ID 
No: 20) 

20 

AATTACATATGGCGGCCGCCGTTTCTGCC (called "HVP6MA" herein) 
(SEQ ID No: 21) 
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AATTACATATGTTCGCGGCCGCCGTTTCT (called "HVP6MF' herein) 
(SEQIDNo: 22) 

The invention also relates to vectors encoding the nucleotide sequence described above 
5 and to host cells including the same. Preferably these vectors are capaible of 
expression in animal, plant or bacterial cell or are capable of transferring the 
sequences of the present invention to the genome of other organisms such as plants. 
More preferably they are capable of expression in insect and crop plant cells. 

10 In a preferred aspect the invention relates to the vectors pDHVRl, pDHVRlRZ, 

pDHVR2, pDHVR2RZ, pl7V71, pl7E71, pPH, pV71, pl7V64, pl7E64, pP64, pV64, 
pBacHVRl, pBacHVRlRZ, pBacHUR2, pBacHVR2RZ, pHSPRl, pHSPRlRZ, 
pHSPR2, pHSPR2RZ, pSRl(E3)A, pSRl(E3)B, pSR2A, pSR2B, pSX2P70, 
pSXR2P70, pSRP2B, pBHVRIB, pBHVR2B, pT7T2P64, pSR2P70, pT7T2P65, 

15 pT7T2P70, pT7T2-P71, pBSKSE3, pBSR15 ? pBSR25p ? pSR25, phr236P70, 
phr235P65 ? P GemP63N, P GemP64N ? pGemP65N ? pP64N, pP65H, pTP6MA ? 
pTP6MF ? pTP17, pTP17delBB ; pP656 or p70G as described hereinafter. 

In a third aspect the invention relates to polypeptides or proteins encoded by HaSV 
20 and to homologues and analogues thereof. This aspect of the invention also relates to 
derivatives and variants of the polypeptides and proteins of HaSV. Such derivatives 
and variants include substitutions and/or deletions of one or more amino acids, and 
amino and carboxy terminal fusions with other polypeptides or proteins. In a preferred 
aspect the invention relates to the proteins P7, PI 6, P17 (SEQ ID No: 48), P64, P70 
25 (SEQ ID No: 52), P71 (SEQ ID No: 50), PI la (SEQ ID No: 42), PI lb (SEQ ID No: 
44), P14 (SEQ ID No: 46) and PI 87 (SEQ ID No: 40) described herein and to 
homologues and analogues thereof, including fusion proteins particularly of P71 (SEQ 
ID No: 50) such as P70 (SEQ ID No: 52) described herein. In a most preferred aspect 
the invention relates to polypeptides or proteins from HaSV which have insecticidal 
30 activity themselves or provide target specificity for insecticidal agents. In particular 
the invention relates to polypeptidesror fragments thereof with insect gut binding 
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specificity, particularly to the variable domains thereof as herein described. In 
addition, homologues and analogues with said insecticidal activity of the polypeptides 
and proteins are also included within the scope of the invention. In addition the 
invention also relates to antibodies (such as monoclonal or polyclonal antibodies or 
chimeric antibodies including phage antibodies produced in bacteria) specific for said 
polypeptide and protein sequences. Such antibodies are useful in detecting HaSV and 
related viruses or the protein products thereof 

In a fourth aspect the invention provides an infectious, recombinant insect virus 
including a vector, an expressible nucleic acid sequence comprising all of, or a portion 
of the HaSV genome, including an insecticidally effective portion of the genome and 
optionally, material derived from another insect virus species or isolate(s). 

Insect virus vectors suitable for the invention according to this aspect, include 
baculoviruses, entomopoxviruses and cytoplasmic polyhedrosis viruses. Most 
preferably, the insect virus vector is selected from the group comprising the 
baculovirus genera of nuclear polyhedrosis viruses (NPV's) and granulosis viruses 
(GV's). In this aspect of the invention the vector acts as a carrier for the HaSV genes 
encoding insectidical activity. The recombinant insect virus vector may be grown by 
either established procedures Shieh, (1989), Vlak (in press) or any other suitable 
procedure and the virus disseminated as needed. The insect virus vectors may be those 
described in copending International application No. PCT/AU92/00413. 

The nucleic acid sequence or sequences incorporated into the recombinant vector may 
be a cDNA, DNA or DNA sequence and may comprise the genome or portion thereof 
of a DNA or RNA of HaSV or another species. The term "material derived from 
another insect virus species or isolate" includes any nucleic acid sequence, or protein 
sequence or parts thereof which are useful in exerting an insecticidal effect when 
incorporated in the recombinant vector of the invention. Suitable nucleic acid 
sequences for incorporation into the recombinant vector include insecticidally 
effective agents such as a neurotoxin from the mite Pyemotes tritici (Tomalski, MD. 
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& Miller, L.K. Nature 352, 82-85 (1991) a toxin component of the venom of the North 
African scorpion Androctonus australia Maeda, S. et al. Virology 184-777-780 (1991) 
Stewart, L.M.D. et al, Nature 352, 85-88 (1991), Conotoxins from the venom of 
Conus spp. (Olivera B.M. et al, Science 249, 257-263 (1990); Woodward S.R. et al., 
5 EMBO J. 9, 1015-1020 (1990); Olivera B.M. et al, Eur. J. Biochem. 202, 589-595 
(1991). 

The exogenous nucleic acid sequence may be operably placed into the insect virus 
vector between a viral or cellular promoter and a polyadenylation signal Upon 
10 infection of an insect cell, the vector virus will cause the production of either 
infectious virus genomic RNA or infectious encapsidated viral particles. 

T \ The promoters may be constitutively expressed or inducible. These include'tissue 
I: 1 specific promoters, temperature sensitive promoters or promoters which are activated 
|J|5 when the insect feeds on a metabolite in the plant that it is desired to protect. 

H Recombinant insect virus vectors according to the present invention may include 
□ nucleic acid sequences comprising all or an infectious or insecticidally effective 
br portion of genome the HaSV and optionally another insect virus species or isolate. 
30 

In a particularly preferred embodiment of the present invention there is provided 
assembled capsids comprising one or more of the capsid proteins of the present 
invention, or derivatives or variants thereof as contemplated or described herein. 
These assembled virus capsids are useful as vectors for insecticidal agents. As such 

25 the assembled viral capsids may be used to administer insecticidal agents such as 

various nucleotide sequences with insecticidal activity or various toxins to an insect. 
Nucleotide sequences in the form of RNA or DNA which can be used include those of 
the HaSV genome or other insect viruses. Toxins which can be used advantageously 
include those which are active intracellularly and may also include neurotoxins with an 

30 appropriate transportation mechanism to reach the insect neurones. 
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The efficacy or insecticidal activity of infectious genomic RNA or viral particles 
produced by insect cells infected with insect vectors according to this aspect of the 
invention, may be enhanced as described below. Moreover the virus vector itself may 
include within a non- essential region(s), one or more nucleic acid sequences encoding 
5 substances that are deleterious to insects such as the insecticidally effective agents 
described above. Alternatively an extra genome component may be added to the 
HaSV genome either by insertion into one of the HaSV genes or by adding it to the 
ends of the genome. 

10 In a particularly preferred embodiment there is provided a recombinant baculovirus 
vector comprising HaSV or part thereof having insecticidal properties. 

Other modifications which may be made to the infectious recombinant insect virus 
according to the fourth aspect include: 

15 

i) splitting the exogenous HaSV nucleic acid molecules comprising the genome 

and cloning the fragments into the insect vector so that they cannot rejoin. One 
component, preferably the virus RNA replicase, could be expressed from a 
separately-transcribed fragment, the transcripts of which would not be 

20 replicated by the replicase they encode. The remainder of the genome (having 

insecticidal activity or encoding the capsid protein or a separate toxin m-RNA) 
could be encoded by, for example, a second separately-transcribed fragment, 
the transcripts of which are capable of being amplified by the replicase. 
Consequently, whilst the transcripts from the second or other fragment would 

25 effect their insecticidal activity upon the infected insect cell, they would not be 

able to infect another insect cell, (even if encapsidated) because the replicase 
or replicase-encoding transcripts would be absent; 

This modification would allow an inherent biological containment to be built 
30 into the insecticidal vectors, which, when used in conjunction with the use of 

non-persistent DNA virus vectors such as those described in the above 
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mentioned copending application, would allow a new level of environmental 
safety greatly extending earlier approaches based on baculovirus vectors. 

ii) Manipulation of encapsidation signals or sequences essential for replicase 
5 binding or production of sub-genomic mRNA's including expression of 

exogeneous insect control factors as RNAs dependent on the virus for 
replication. This involves determination of RNA sequences and signals 
important for replication and encapsidation of virus RNAs, such as by analysis 
of replication of deletion mutants carrying reporter genes in appropriate cells, 
10 followed by studies on the transmission of the reporter gene to larvae by 

feeding of virus. These deletion mutants can be used to carry genes for insect 
control factors/toxins to larvae after replacing the reporter gene by a suitable 
toxin gene such as shown in Fig. 12; ' 

1 5 iii) using an insect promoter responsive to virus infection and, for example, placing 
copies of the viral replicase gene under the control of two promoters, one 
which is constitutive or expressed at early stages of vector infection, and the 
other being a cellular promoter turned on by the ensuing RNA viral infection. 
This system would then make more copies of the replicase mRNA available as 

20 the amount of its template increased. Such a promoter may be isolated using 

techniques analogous to enhancer trapping, that is, transforming insect cells 
with a suitable reporter gene and looking for induction of the reporter upon 
virus infection of a population of transformed cells. 

25 In a fifth aspect the invention relates to a method of controlling insect attack in plants 
by genetically manipulating plants to express HaSV or parts thereof which can confer 
insecticidal activity optionally in combination with other insecticidally effective 
agents. Such plants are referred to as transgenic plants. 

30 The term "express" should be understood as referring to the process of transcribing the 
genome or portion thereof into RNA or, alternatively, the process of transcribing the 
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genome or portion thereof into RNA and then, in turn, translating the RNA into a 
protein or peptide. 

In a sixth aspect the invention relates to the transgenic plants per se as described 
above. Transgenic plants according to the invention may be prepared for example by 
introducing a DNA construct including a cDNA or DNA fragment encoding all or a 
desired infectious portion of HaSV, into the genome of a plant. The cDNA or DNA 
fragment may, preferably, be operably placed between a plant promoter and a 
polyadenylation signal Promoters may cause constitutive or inducible expressi6n of 
the sequences under their control. Furthermore they may be specific to certain tissues, 
such as the leaves of a plant where insect attack occurs but not to other parts of the 
plant such as that used for food. The inducible promoters may be induced by stimuli 
such as disturbance of wind or insect movement on the plant's tissues, or may be 
specifically turned on by insect damage to plant tissues. Heat may also be a stimulus 
for promoter induction such as in spring where temperatures increase and likelihood of 
insect attack also increases. Other stimuli such as spraying by a chemical (for 
instances a harmless chemical) may induce the promoter. 

The cDNA or DNA fragment may encode all or a desired infectious portion of the 
wild-type, recombinant or otherwise mutated HaSV. For example, deletion mutants 
could be used which lack segments of the viral genome which are non-essential for 
replication or perhaps pathogenicity. 

The nucleotide sequences of the invention can be inserted into a plant genome by 
already established techniques, for example by an Agrobacterium transfer system or by 
electroporation. 

Plants which may be used in this aspect of the invention include plants of both 
economic and scientific interest. Such plants may be those in general which need 
protection against the insect pests discussed herein and in particular include tomato, 
potato, corn, cotton, field pea and tobacco. 
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To enhance the efficacy of infectious genomic RNA or viral particles expressed by 
transgenic plants according to the invention, the DNA construct introduced into the 
plants' genome may be engineered to include one or more exogenous nucleic acid 
sequences encoding substances that are deleterious to insects. Such substances 
5 include, for example, Bacillus thuringiensis d-toxin, insect neurohormones, 

insecticidal compounds form wasp or scorpion venom or of heterologous origin, or 
factors designed to attack and kill infected cells in such a way so as to cause 
pathogenesis in the infected tissue (for example, a ribozyme targeted against an 
essential cellular function). 

10 

DNA constructs may also be provided which include: 

i) mechanisms for regulating pathogen expression (for example, mechanisms 
which restrict the expression of ribozymes to the insect cells) by tying for 

15 example, expression to abundant virus replication, production of minus-strand 

RNA or sub-genomic mRNA's; and/or 

ii) mechanisms similar to, or analogous to, those described in copending 
International patent application number PCT/AU92/00413 so as to achieve a 

20 limited- spread system (such as control of replication). 

Transgenic plants according to the present invention may also be capable of expressing 
all or an infectious or insecticidal portion of genomes from HaSV and one or more 
species or isolates of insect viruses, 

25 

In a seventh aspect of the invention HaSV, or insecticidally effective parts thereof, or 
the infectious recombinant virus vectors of the fourth aspect of the present invention 
may be applied directly to the plant to control insect attack. HaSV or the recombinant 
virus vectors may be produced either in whole or in part in either whole insects or in 
30 culture cells of insects or in bacteria or in yeast or in some other expression system, 

HaSV or the recombinant virus forms may be applied in a crude form, semi purified or 
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purified form optionally in admixture with agriculturally acceptable carrier to the crop 
in need of protection. HaSV may also be applied as a facilitator of infection where 
existing insect populations already infected with another agent, such as one or more 
other viruses whereby HaSV is able to act synergistically to bring about an insecticidal 
5 effect. Alternatively HaSV and another agent such as one or more viruses may be 
applied together to plants to control insects feeding thereon. 

A deposit of HaSV No. 18.4 was made on August 5th 1992 at the Australian 
Government Analytical Laboratories. The deposit was given accession No. 
10 N92/35575. ' 

EXAMPLE 1 

TAXANOMIC, PHYSIOCHEMICAL 
AND BIOCHEMICAL CHARACTERISATION 
15 OF AN INSECT VIRUS: HaSV 

Materials and Methods 

A Animals and virus production. H. Armigara larvae were raised as described 
in Teakle R.E. and Jensen J.M. (1985) Heliothis punctiger in Singh P and 
Moore R.F. (eds) Handbook of Insect Rearing Vol 2., Elsevier, Amsterdam pp 

20 3 13-322. Larvae were infected for virus production by feeding five day old 

larvae on lOmg pieces of diet to which 0.064 OD 260 units of HaSV had been 
applied. After 24 hours the larvae were then transferred to covered 12-well 
plates (BioScientific, Sydney, Australia) that contained sufficient diet and 
grown for eight days after which they were collected and frozen at -80 °C until 

25 further processed. Frozen larvae were weighed to lOOg, placed into 200ml of 

50mM Tris buffer (pH 7.4), homogenized, and filtered through four layers of 
muslin. This homogenate was centrifuged in a Sorvall SS-34 rotor at 10,000 x 
g for 30 minutes whereupon the supernatant was transferred to fresh tubes and 
recentrifuged in Beckman SW-28 rotor at 100K xg for 3 hours. The resultant 

30 band was collected and repelleted in 50 mM pH 7.2 Tris buffer in a Beckman 

SW-28 tube by centrifugation at 100K xg for 3 hours. The pelleted virus was 
resuspended overnight in 1ml of buffer at 4°C then layered onto a 



26 



discontinuous CsCl gradient containing equal volumes of 60% and 30% CsCl 
(w/v) in a Beckman SW-41 tube and centrifuged at 12 h at 200 xg. The 
resultant pellet was suspended in 100ml of buffer and frozen for further use. 

Particle characterization. Staining with acridine orange was as described in 
Mayor ELD. and Hill N.O. (1961) Virology 14: p264. Buoyant density was 
estimated in CsCl gradients according to Scotti P.D., Longworth J.R, Plus N, 
Crozier G. and Reignanum C. (1981) Advances in Virus Research 26: 1 17-143. 

Immunological procedure. Rabbit anti-sera to HaSV was produced by 
standard immunological procedures. Rabbit antisera to the Nudaurelia o virus 
in addition to the virus itself was provided by Don Hendry (Rhodes University, 
Grahamstown, South Africa). Rabbit antisera to the Nudaurelia 6 vims was 
supplied by the late Carl Reinganum (Plant Research Institute, Burnley, Vic, 
Australia). The immunological relationship to the Nudaurelia w virus was 
determined by the standard reciprocal double diffusion technique. 
Immunoblotting was performed according to Towbin H., Staeheln T. and 
Gordon J. (1979) PNAS. Antibodies monospecific for the major 65 kDa capsid 
protein were prepared by incubating polyclonal antisera with sections of 
nitrocellulose blotted with the 65 kDa protein. After extensive washing in Tris . 
buffered saline, the bound antibodies were eluted in 50mM citric buffer, pH 8.0 
after a 5 minute incubation. 

Protein characterization. Polyacrylamide gel electrophoresis in the presence 
of SDS followed the procedure of Laemmli UK 1970 Nature 227; 680-685 and 
was done with 12.5% gels unless otherwise noted with low and high molecular 
weight standards from BioRad. Staining was done with a colloidal preparation 
of Coomassie Blue G-250 (Gradipore Ltd, Pyrmont, New South Wales, 
Australia). Determination of the of the smallest protein was done with a 
16% gel and standards of 3.4 kDa, 12.5 kDa and 21.5 kDa (Boehringer 
Mannheim). Glycosylation of the viral proteins was determined by a general 
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glycan staining procedure with reagents supplied by Boehringer Mannheim; the 
positive control was fetuin. N-termini of proteins were sequenced using 
procedures described by Matsudairia (1989) Purification of Proteins and 
Peptides by SDS-PAGE in A Practical Guide to Protein and Peptide 
5 Purification for Microsequencing ed Matsudaira P.T. Academic Press, San 

Diego pp 52-72 on an Applied Biosystems 477A gas phase sequencer. 

E Nucleic acid characterization. RNA was removed from capsids by twice 
vortexing a virus suspension with equal volumes of neutralized phenol then 
10 with phenol/chloroform (50:50). RNA was then precipitated from the aqueous 

phase in the presence of 300 mM sodium acetate and 2.5 volumes of ethanol. 
Digestions of the HaSV nucleic acid with RNAse A and DNAse I (Boehringer 
Zj Mannheim) were done with pBSSK(-) phagemid ssDNA and dsDNA 

j;j (Stratagene) and RNA controls (BRL). Denaturing agarose gel electrophoresis 

U! 15 in the presence of formaldehyde was performed according to Sambrook et al 

w (1989). The state of polyandenylation of the viral RNA was determined by two 

^ methods. The first method was to compare the binding of identical amounts 

£3 (20 mg) of viral RNA and poly(A)-selected RNA from Helicoverpa virescens 

to a 1ml slurry of Smg of oligo-d(T) cellulose (Pharmacia) in a binding buffer 
5 20 consisting of 20 mM Tris pH 7.8, 500 mM NaCl ? 1 mM EDTA and 0.04% 

SDS. The second method was to observe specific priming of viral RNA and 
viral RNA polyadenylated with poly(A) polymerase (Pharmacia) with d(T) 
16 A/C/G primers in RNA sequencing reactions using reverse transcriptase (US 
Biochemical) and a protocol provided by the supplier. The 5" cap structure of 
25 the genomic RNA and HaSV was determined by observing the ability of 

polynucleotide kinase to phosphorylate viral RNA with and without 
preincubation with tobacco acid pyrophosphatase and alkaline phosphatase 
(Promega) under conditions described by the supplier. 
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F In vitro translation of HaSV RNA. In vitro translation of HaSV RNA was 
performed with lysates of both rabbit reticulocytes and wheat germ (Promega) 
as directed by the supplier. Reactions were conducted in 10 ml volumes with 
1 .0 mg of RNA in the presence of five u Ci 35 S -methionine. The labelled 
5 proteins were resolved on 10% and 14% SDS-PAGE gels as described above 

then visualised by autoradiography of the dried gels. The two viral RNAs were 
separated by a "freeze and squeeze" method after resolution on nondenaturing 
low-melting-point agarose gels in TAE (Sambrook, et al. 1989). Briefly, 
agarose slices containing the RNA were melted at 65° C in a volume of TAE 
10 buffer equal to six times the agarose volume. The solution was allowed to gel 

on ice before freezing at -80° C for 30 minutes. The frozen solution was 
thawed on ice then centrifuged at 14,500xg for 10 minutes after which the 
supernatant was withdrawn and precipitated by the addition of ethanol. 



G Bioassay of virus-induced pathogensis 

Known amounts of virus isolate, as shown in Figure 3, were fed to 
larvae at the growth stages indicated by admixture to stadnard diet. At 
the time points shown, the larvae were weighed and the mean and SD 
20 calculated. Growth of infected larvae was compared to those of 

uninfected control populations from the same hatching batch in every 
experiment. 

Results 

25 i) Characteristics and taxonomy of HaSV 

The virus particles are isometric and are approximately 36 - 38 nm in diameter. They 
are composed of two major capsid proteins, of 65 kDa and 6kD is size. The virions 
contain two single-stranded (+) RNA species of 5.3 kb and 2.4 kb length. The virus 
bears a similarity in these respects to the Nudaurelia w virus, which has been 
30 tentatively regarded as a member of the Tetraviridae; these two viruses differ however, 
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in the above respects from other viruses in this group and are likely to form a new 
virus family, sharing chiefly their capsid structure (T=4) with the Tetraviridae. 

ii) Particle characterization and serology, 

5 The buoyant density of HaSV was calculated to be 1 .296g/ml in CsCl at pH 7.2. The 
A 260 /A 280 ratio of HaSV viral particles was 1.22 indicating a nucleic acid content of 
approximately 7% (Gibbs and Harrison, (1976) Plant Virology: The Principles 
London: Edward Arnold. Reciprocal immuno-double diffusion comparisons between 
HaSV and the Nudaurelia w virus showed no serological relationship. The morb 

10 sensitive technique of immunoblotting also showed a complete lack of any antigenic 

relationship. In addition, HaSV did not react with antisera to the Nudaurelia b virus in 
a immuno-diflEusion test or when immunoblotted. However, no Nudaurelia b virus was 
available as a positive control in these latter two immunological experiments. When 
HaSV was stained with acridine orange then irradiated with 3 lOnm UV light, the 

1 5 particles fluoresced red which indicated a single stranded genome. 

iii) Protein characterization. 

Examination of the capsid proteins of HaSV with polyacrylamide gel electrophoresis 
in the presence of SDS showed variable results depending on the quantity of protein 

20 present. At low protein loadings, two proteins in major abundance were evident that 
had Mr's of 65,000 and 6,000 along with a protein in minor abundance with Mj. of 
72,000 (data not shown). When more protein was present on the gels, however, at 
least 12 more distinct bands with M^s ranging between 15,000 and 62,000 became 
evident. Probing the resolved and blotted proteins with antibodies monospecific for 

25 the major 65 kDa capsid protein showed all but two of the proteins shared common 
antigens with the major 65 kDa protein. The major 6 kDa capsid protein and a minor 
band migrating at M = 16,000 failed to react with both the monospecific antibodies and 
untreated antisera. 
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The capsid proteins were shown to be non-glycosylated as they failed to react with a 
hydrazine analog after oxidation with periodic acid. The N-terminus of the 65 kDa 
protein appeared to be blocked in some manner as two efforts to conduct an Edman 
degradation failed. After the second attempt, the sample was treated with n- 
chlorosuccinimide and shown to be in a quantity normally adequate for sequencing. 
The N-terminus of the 6 kDa protein, however, was not blocked as an unambiguous 
16-residue sequence was readily obtained. The sequence of the N-terminus of the 6 
kDa capsid protein and those of a cyanogen bromide cleaved fragment of the 65 kDa 
protein are as follows: 

6 kDa protein: 

PheAlaAlaAlaValSerAlaPheAlaAlaAsnMetLeuSerSerValLeuLysSer 
(SEQIDNo: 23) 
65 kDa protein: 

ProThrLeuValAspGlnGlyPheTrpIleGlyGlyGlnTyrAlaLeuThrProThrSer 
(SEQ ID No: 24) 

Detailed sequence analysis of the RNA genome carried out in Example 3 showed that 
RNA 1 (SEQ ID No: 39) encodes a protein of molecular weight 186,980 hereinafter 
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referred to as P187 (SEQ ID No: 40) and RNA 2 (SEQ ID No: 47) encodes proteins 
with molecular weight 16, 522 (called P17 (SEQ ID No: 48)) and 10,610 (called P71 
(SEQ ID No: 50)). P71 (SEQ ID No: 50) is processed into two proteins of molecular 
weight 63,378 (called P64) and 7,309 (called P7). 
iv) Nucleic acid characterization 

The extracted nucleic acid from HaSV was readily hydrolysed by RNAse A but not by 
DNAse I. Denaturing agarose gel electrophoresis of the extracted RNA genome of 
HaSV indicated two strands that migrated at 5.5 kb and 2.4 kb. The RNA strands were 
shown not to have extensive regions of polyadenylation as only 24% of the viral RNA ' 
bound to the oligo-d(T) cellulose matrix as opposed to 82% of poly(A)-selected RNA. 
Further evidence for the non-poly adenylation of the viral genome was provided by the 
observation that the oligo primer, d(T) 16 G, gave a clear sequencing ladder using 
reverse transcriptase only after in vitro polyadenylation of the viral strancfs with 
poly(A)-polymerase. 

The demonstration that the strands could be modified with poly(A)-polymerase also 
showed the lack of any 3' modification. The 5' termini of the viral strands were shown 
to be capped, most likely with m 7 G(5 , )ppp(5 , )G, as they could not be labelled with 
polynucleotide kinase unless pretreated with tobacco acid pyrophosphatase and 
alkaline phosphatase. 

v) In vitro translation. 

In vitro translation of the viral RNA yielded different results in the two translation 
systems used (data not shown). The 5.5 kb RNA translated very poorly in the 
reticulocyte system whereas it produced in the wheatgerm system more than 20 
proteins ranging in size from 1^=195,000 to M= 12,000. The 2.4 kb viral RNA strand 
yielded a major protein with an 1^=24,000 in both systems in addition to a minor 
protein at M=70 kDa. A time course of the translation reaction with the 5.5 kb RNA 
strand showed all labelled proteins were produced at similar rates indicating that the 
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smaller products did not arise through processing of the larger ones. However when a 
time course experiment was done with translation of the smaller 2.4 kb RNA strand, 
the 24 kDa protein appeared before the 70 kDa protein. 

5 vi) Presence of another form of HaSV 

Frequently, during purification of HaSV virions, a minor band appeared in varying 
amounts on the CsCl gradient that had a buoyant density of 1.3 g/ml. On four 
occasions, when particles from this minor band were used to infect H. armigera larvae 
that were then processed as before for purification of HaSV virions, the HaSV band 

10 with a density of 1.296g/ml was again recovered in vast excess to a varying minor 
amount of the more dense band. No virions of either type were recovered from 
uninfected control larvae. Proteins extracted from the more dense particles appeared 
identical to those from the less dense particles when examined by SDS-PAGE and 
immunoblotting with antibodies specific for the 65 kDa capsid protein of HaSV. 

1 5 Extraction and examination of the RNA genome with denaturing agarose gel 

electrophoresis also showed the same 5.5 and 2.4 kb bands. When particles from the 
more dense band were examined by electron microscopy as before, they appeared to 
have a larger diameter 45nm but otherwise highly similar to the 38nm particles. 

20 The molar ratio of the two RNA strands was determined by quantitative densitometry 
of fluorograms of the resolved strands. The ratio derived from an average of four 
measurements of various loadings on denaturing gels proved to be 1.7:1 (5.5 kb strand: 
2.4 kb strand) which is somewhat lower than the expected ratio of 2.3:1 for equimolar 
amounts of each strand. 

25 

The genome of HaSV has major differences that make it distinct from those of the 
nodaviruses, the only other group of bipartite small RNA viruses pathogenic to 
animals. Although HaSV shares the characteristic of a bipartite genome with the only 
animal viruses having such a divided genome, the nodaviridae, it differs in virtually 
30 every other aspect from this group. Both segments of its genome are considerably 
larger than the 
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corresponding nodaviral RNAs (Hendry DA, (1991) Nodaviridae of [nvertebrates. in 
(ed. E. Kurstak) Viruses of Invertebrates. Marcel Dekker, New York, pp. 227-276). 
However, the division of genetic labour is similar with the larger component carrying 
the replicase gene and the smaller one encoding the capsid proteins. Direct 
5 comparison of the sequences shows little homology between these viruses, at either 
RNA or protein level. The Nodaviruses, have the already mentioned unusual 
3 'blockage (probably a protein), whereas the HaSV RNAs terminate in a distinctive 
secondary structure resembling a tRNA. 

10 vii) Bioassays of virus isolates on larvae 

The original constructs made to express the capsid proteins (precursor and processed 
forms) in E. coli for bioassay started at the first AUG (nts 284 to 286). Production of 
full-length, immuno-reactive protein from these was due to these clones being the 5C 
sequence version with the extra C residue. Bioassays of these proteins have been 
1 5 difficult due to problems with obtaining suitable Heliothis larvae for the tests. 

Purified native HaSV was used to conduct bioassays in non-noctuid insect species. 
The native HaSV was orally administered, the larvae scored for symptoms of infection 
and growth was measured. Dot blotting for HaSV RNA was also conducted. Based 
20 on these experiments native HaSV does not appear to infect the following larvae. 



Species 


Order 


Family 


Galleria mellonella 


Lepidoptera 


Pyradidae 


Tineola bissellia 


Lepidoptera 


Tineidae 


Epiphyas postvittana 


Lepidoptera 


Tortricidae 


Lucilia cuprina 


Diptera 


Calliphoridae 


Dacus tyronii 


Diptera 


Tephritidae 


Antitrogus parvulus 


Coleoptera 


Scarabaediae 


Lepidiota picticollis 


Coleoptera 


Scarabaediae 


Sericesthis germinata 


Coleoptera 


Scarabaediae 
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The above experiment conducted with the larvae of Spodoptera exigua and S. litura 
showed that native HaS V infects these species but not to the same degree as seen in 
Heliothis armigera, 

5 

EXAMPLE 2 
OTHER VIRUS ISOLATES 

Materials and Methods 
A Virus isolation 

10 Apparently infected (viz diseased) larvae of Helicoverpa sp were collected in February 
1993 at Mullaley (NSW), Narrabri (NSW) and Toowoomba (QLD) (Australia). 
Referring to Fig. 10 the samples in wells 2A-2D were from parasitised K armigera 
larvae collected from sorghum at Mullaley; the sample in 6C was collected from 
sunflower at Toowoomba; the sample in 7D was collected from cotton at the Narrabri 

1 5 Research Station. The latter two larvae may have been either H. armigera or H. 
punctigera, which are both easily infected with HaSV. 

B Virus RNA Extraction 

Larvae collected were ground up and RNA extracted. RNA extraction and purification 
20 were as per Example L 

C Dot-Blot Northern Hybridization 

Extracts of viral RNA was analysed by Northern dot-blot hybridisation using a probe 
made from cloned HaSV sequences derived from 3 -terminal 1000 units of RNA 1 and 
25 RNA 2 by random priming in a Boehringer Mannheim kit according to the supplier's 
instructions were employed. RNA extracts were transferred to Zeta-Probe (BioRad) 
for probing. Hybridization under high stringency washing conditions were as 
specified by BioRad. Hybridizations were carried out in the following solution: 
1 mM EDTA, 500 mM HaH 2 P0 4 , pH 7.2, 7% SDS, at 65°C in a 
30 rotating Hybaid hybridization chamber. After completion of 

hybridization and removal of the solution containing the probe, the 
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filters were washed twice in 1 mM EDTA, 40 mM HaH 2 P0 4 pH 7.2, 5% 
SDS, at 65°C (1 h each), followed by 2 washes in 1 mM EDTA, 40 mM 
HaH 2 P0 4 , pH 7.2 1% SDS, at 65°C (1 h each), before autoradiography. 

5 RESULTS 

Referring to Fig. 10, samples 9 A, 9B, 10A, 10B and 10C contain HaSV infected 
positive control lab-raised larvae; 9C-H contain healthy (HaSV-free) negative control 
lab-raised larvae; All other wells (beginning 1-8) contain extract from field-collected 
larvae. Numbers 2A-D, 6C and 7D gave positive signals indicating that these isolates 
10 are either the same as HaSV or derivatives or variants thereof. Election microscopy 
employing (-) staining confirmed that the samples which gave positive signals 
contained abundant icosohedral virus particles of approximately 36mm in size. 

The presence of HaSV in larvae which had tested positive in the Northern 
1 5 hybridization dot-blot was confirmed by Western blotting of crude extracts from such 
infected larvae, using the polyclonal antibody to the HaSV capsid protein. For routine 
screening of such extracts in order to identify further isolates of HaSV or to confirm 
the presence of the virus, use of a monoclonal antibody or its equivalent is preferable, 
in order to achieve (i) higher sensitivity of detection and (ii) greater specificity of 
20 detection. 



EXAMPLE 3 

25 IDENTIFICATION, ISOLATION AND CHARACTERISATION OF INSECT 

VIRUS GENES 

Materials and Methods 

A Animals and virus production. 

H. armigera larvae were raised as described in Example 1 . 



B Protein characterization 
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Was conducted as described in Example 1 . 

C Nucleic acid characterization 

Was conducted as in Example 1 . 

5 

D Fractionation of virus RNA 

The two viral RNAs were separated by a "freeze and squeeze" method after resolution 
on nondenaturing low melting point agarose gels in TAE (Sambrook, et al, 1989). 
Briefly, agarose slices containing the RNA were melted at 65° C in a volume of TAE 
1 0 buffer equal to six times the agarose volume. The solution was allowed to gel on ice 
before freezing it at -80° C for 30 minutes. The frozen solution was thawed on ice 
then centrifiiged at 14 ? 500g for 10 minutes after which the supernatent was withdrawn 
and precipitated by the addition of ethanol. ' 

15 E In vitro translation of HaSV RNA 

Was as in Example 1 . 

F cDNA synthesis and cloning of virus genome 

The virus RNAs were reverse transcribed into cDNA using the Superscript RTase (a 
20 modified form of the Moloney murine leukaemia virus (MMLV) RTase, produced by 
Life Technologies Inc). Oligo(dT) was used as a primer on RNA which had been 
polyadenylated in vitro. After size selection of DNA fragments over 1 kbp in length, 
the cDNA was then blunt-end ligated using T4 DNA ligase (Boehringer Mannheim or 
Promega, under conditions described by the suppliers) into vector pBSSK(-) 
25 (Stratagene) which had been cut with EcoRV and dephosphorylated with calf intestinal 
alkaline phosphatase (Boehringer Mannheim). E.coli strain JM109 or JPA101 were 
electroporated with the ligation mixture and white colonies selected on colour- 
indicator plates Sambrook et al 1989. 

30 For some clones of RNA2 (SEQ ID No: 47), cDNA was synthesised using the RTase 
of AMV (Promega) and a specific primer complementary to nucleotide sequence 2285 
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- 2301 of RNA 2 (SEQ ID No: 47). The same buffer and conditions were used for the 
Superscript RTase (above). The AMV RTase was found not to make cDNA form a 
primer annealing to the terminal 18 nucleotide sequence (see below), nor to be able to 
reach the 5-end of the RNA with the primer here described. 

5 

G Sequencing of DNA and RNA 

The cDNA clones were separated as single-stranded or double-stranded DNA, using 
the deaza-dGTP and deaza-dlTP nucleotide analogues (Pharmacia) in the deaza T7 
sequencing kit as recommended by this supplier. Synthetic oligonucleotides were used 
10 as primers. The 5' terminal sequences of the two RNAs were determined using reverse 
transcriptase to sequence the RNA template directly, from specific oligonucleotide 
primers located about 200 nucleotides downstream from the termini. Such RNA 
sequencing was performed using the reverse transcriptase sequencing kit from 
Promega, under the conditions described by the manufacturer. 

15 

The sequence of the 20 or so nucleotides at the 5' terminus of each RNA was checked 
using direct RNase digestion of 5-labelled RNA under conditions designed to confer 
sequence-specificity. Direct RNA sequence using RNases was performed with the 
RNase sequencing kit from US Biochemicals, following the protocols provided by the 
20 manufacturer. This also confirmed that the sequence of the most abundant RNA is 
consistent with that of the RNA analysed using the specific primer and RTase. 

All transcription of plasmids linearized as described were performed as recommended 
by the suppliers of SP6 RNA polymerase, in the presence of ImM cap analogue, 
25 0.2mM GTP, and 0.5mM of the other NTPs. 
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H Subcloning and expression 

PCR amplification 

The polymerase chain reaction (PCR) was used to obtain sequences covering virus 
genes in a form suitable for cloning into expression vectors. The reaction was 
5 performed with Taq DNA polymerase (Promega) as described by the supplier, in a 
rapid cycling thermal sequencer manufactured by Corbett Research (Sydney, 
Australia). A typical reaction involved 1 cycle of 1 min at 90° C, 25 cycles of 95° C (10 
sec), 50°C (20 sec), 72°C (1.5 min), followed by one cycle of 72°C for 5 min. 
Templates were generally cDNA or cDNA clones derived from HaS V RNAs, made as 
10 described below. Primers were as described below for the relevant constructs. 

Upon termination of the PCR reaction, the product's ends were made blunt by 
treatment with E.coli DNA polymerase I (Klenow fragment) at ambient temperature 
for 15 minutes. After heating at 65° C for 10 minutes, the reaction was cooled on ice 
1 5 and the reaction mix made ImM in ATP. The product then 5-phosphorylated using 5 
units of T4 polynucleotide kinase at 37° C for 30 minutes. After heating at 65° C for 
10 minutes, the product was run on a 1% low-melting agarose gel and purified as 
described for RNA in section E above. 

20 ligations: Vectors and restriction fragments cut with the enzymes described were run 
on 1% low-melting-point agarose gels and excised as slices. These slices were then 
melted at 65° C for 5 minutes, before cooling to 37° C. Fragment and vectors were 
then ligated in lOul total volume at 14° C overnight using T4DNA ligase (BRL, 
Boehringer Mannheim or Promega), in the buffers supplied by the manufacturers. 

25 

expression: Expression plasmids containing viral genes (e.g. for the capsid protein) 
were transformed into E. colt strain BL21 (DE3) or HMS174 (DE3) (supplied by 
Novagen). After growth as specified by the supplier, protein expression was induced 
by the addition of isopropyl b-D-thiogalactopyranoside (IPTG), at 0.4 nM to the 
30 growing culture for a period of 3h. Expressed proteins were analysed by SDS- 
polyacrylamide gel electrophoresis of bacterial extracts (Laemmli, 1970). 
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Results 

i) Mapping cDNA clones of HaSV 

5 The template for cDNA synthesis was virus RNA which had been polyadenylated in 
vitro. Oligo(dT) was used as a primer for the Superscript reverse transcriptase (RTase; 
a modified form of the Moloney murine leukaemia virus (MMLV) RTase, produced by 
Life Technologies Inc). The cDNA was cloned into vector pBSSK(-) as described 
earlier. The larger clones were selected for further analysis by restriction mapping and 

10 Northern hybridization. All the probes tested hybridized either to RNA 1 or to RNA 2, 
suggesting that there are no regions of extensive sequence homology between the two 
RNAs. Furthermore, screening of a number of other clones excluded the theoretical 

p I possibility that either RNA band may actually contain more than one species/ 

|| ii) RNA 1 (SEQ ID No: 39) clones 

W Three large RNA 1 (SEQ ID No: 39) clones (Bl 1U, Bl lO and B35) obtained for the 
first round of clones were further analysed by restriction mapping and shown to form 
an overlap spanning over 3 kbp (this was later confirmed by sequencing). The second 

yj round of cloning then yielded E3 of 5.3 kbp, representing 99.7% of RNA 1 (SEQ ID 
No: 39). A complete restriction map of clone E3 showed it to align with that 
previously determined for three overlapping clones. On the basis of this alignment, the 
5' end of the insert in Bl 1U was placed about 300 nucleotides downstream from the 5' 
end of the RNA. 

25 Once clones covering a contiguous block had been identified, the orientation 3 relative 
to the RNA was determined. 

iii) RNA 2 (SEQ ID No: 47) clones 

Three significant cDNA clones were isolated for RNA 2 (SEQ ID No: 47) (Fig. 2). 
30 One, hr236, contains about 88% of RNA 2 (SEQ ID No: 47) (2470 bp total length), 

and runs from the 3' end to 240 bp from the 5' end. The other clones, hr247 and hr 249 
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position 34 and terminates at nucleotide 5290 and is thought to encode the RNA- 
dependent RNA polymerase (replicase)(referred to as PI 87 (SEQ ID No: 40) in Fig. 1) 
required for virus replication, since it contains the Gly-Asp-Asp conserved triplet and 
surrounding sequences identified in these enzymes, which are usually large (over 1 00 
5 kDa), in addition to further homology with the polymerase encoded by tobacco mosaic 
virus and other plus-stranded RNA viruses. 

Referring to Fig. 1 the sequence is presented as the upper strand of the cDNA 
sequence. This strand is therefore in the same sense as the viral (positive-sense) RNA. 
10 The sequence of the protein encoded by the major open reading frame, encoding the 
putative RNA-dependent RNA replicase, is shown, as are those of the small open 
reading frames at the 3' end, corresponding to the proteins PI la (SEQ ID No: 42), 
PI lb (SEQ ID No: 44) and P14 (SEQ ID No: 46). ' 

1 5 Clone E3 was inserted downstream of the SP6 promoter for in vitro transcription. As 
mentioned above, the transcript of this clone can be translated in the wheat germ 
system to yield the 195 kDa protein observed upon translation of fractionated RNA 1 
(SEQ ID No: 39) from the virus. The latter yields more lower molecular weight 
products, presumably due to being contaminated with nicked and degraded RNA. The 

20 products derived from the in vitro transcript can therefore be regarded as defining the 
coding capacity of the complete RNA 1 (SEQ ID No: 39) of HaSV. 

vi) Sequence of genome component 2 (see Figure 2) 

The 2470 nucleotides encode a protein of molecular weight 71,000 which contains the 
25 peptide sequences corresponding to those determined from the two virus capsid 

proteins. This protein is therefore the precursor of these capsid proteins. The protein 
is a major product of in vitro translation of this RNA obtained either from virus 
particles or by in vitro transcription of a full-length cDNA clone; in addition, another 
major translation product of apparent molecular weight 24,000 is obtained. This 
30 protein is derived from a molecular weight 17,000 reading frame overlapping the slab 
of the capsid protein gene. 
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Clones hr236 and hr247 were completely sequenced as the first step in RNA 2 
sequencing. These sequences were then extensively compared to that obtained by 
direct RNA sequencing using AMV reverse transcriptase. 

5 Comparison of the cloned sequence with that by direct RNA sequencing showed both 
clones lacked 50 nucleotide present in the RNA (at around nucleotide 1500). The 
sequence of this stretch was obtained by direct RNA sequencing using the AMV 
RTase. The MMLV "Superscript" RTase, which was used to make all the cDNA 
clones, was found to simply by-pass this region in sequencing reactions. These 50 
10 nucleotides contain a very stable GC-rich hairpin flanked by a 6 bp direct repeat, and 
the MMLV RTase skips from the first repeat to the second. 

The sequence of RNA 2 (SEQ ID No: 47) was then completed using plasmids pSR2A 
and pSR2P70 constructed as described below. The plasmids contain a segment of 

15 cDNA derived for the AMV RTase, as well as the sequence corresponding to the 5' 

240 nucleotides of RNA 2 (SEQ ID No: 47) which are not present on phr236 (Fig. 2). 
The sequence of RNA in Fig. 2 is presented as the upper strand of the cDNA sequence. 
This strand is therefore in the same sense as the viral (positive-sense) RNA. The 
sequences of the proteins encoded by the major open reading frames, encoding the 

20 capsid protein precursor P71 (SEQ ID No: 50), and P17 (SEQ ID No: 48). 

The sequence of RNA 2 (SEQ ID No: 47) encodes a major ORF running from a 
methionine initiation codon at nucleotides 366 to 368 to a termination codon at 
nucleotides 2307 to 2309. This protein encoded by this ORF has a theoretical 
25 molecular weight of 71,000 (SEQ ED No: 50). This initiation codon is in a good 

context (AGGatgG), suggesting that it will be well recognized by scanning ribosomes. 
The size of the product is close to that of the residual putative precursor protein 
identified in purified virus, and to the size of the in vitro translation product obtained 
from RNA 2 (SEQ ED No: 47). 
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The approach adopted to identify the gene encoding the capsid protein was to obtain 
amino acid sequence information from the two abundant capsid proteins and then 
locate these on the protein encoded by the sequence of the virus RNAs. CNBr cleaved 
products of the capsid protein were therefore sequenced. These fragraents gave a clear 
5 and unambiguous sequence shown in Example 1 . These sequences determined were 
then located on the large ORF of RNA 2 (SEQ ID No: 47). (Figure 2) 

In the case of the small capsid protein, the clear and unambiguous sequence, obtained 
is located near the carboxy terminus of the major ORF on RNA 2 (SEQ ID No:' 47). 
10 Starting at the point corresponding to the amino-terminal residue of the sequence 
determined for the 6 kDa protein, and continuing to the carboxy-terminus of the 
complete reading frame, the protein encoded by the sequence 7.2 kDa and has a 
hydrophobic N-terminal region and an arginine rich (basic) C-terminal region. It is an 
extremely basic protein with a pi of 12.6. 

15 

The two abundant capsid proteins are derived from a single precursor, which is 
processed at a specific site. This is presumably immediately amino-terminal to the 
sequence FAAAVS.... (SEQ ID No: 25) 

20 RNA 2 (SEQ ID No: 47) appears to be a bicistronic mRNA (see Figs. 2 and 5). The 
first methionine codon is encoded on the sequence of RNA at nucleotides 283 to 285. 
This ATG is in a poor context (TTTatgA), making it a weaker initiation codon. It 
initiates a reading frame of 157 amino acids, encoding a protein of molecular weight 
17,000 (SEQ ID No: 48). (The second AUG [nts 366 to 368] initiates the 71 kDa 

25 (SEQ ID No: 50) precursor of the capsid protein). Since the first AUG is in a poor 
context, abundant expression of the capsid precursor would be expected. In fact, in 
vitro translation of a full length RNA 2 (SEQ ID No: 47) transcribed from a 
reconstructed cDNA clone yields two major protein products of relative mobility 
71,000 (SEQ ID No: 50) and 24,000, similar to those already observed upon 

30 translation of viral RNA 2 (SEQ ID No: 47). The protein of Mr 24,000 appears to 

correspond to the 157 amino acid protein, despite the significant anomaly in apparent 
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size. The 24,000 Mr product was also observed upon translation of an in vitro 
transcript covering only nucleotides 220 to 1200 of RNA 2 (SEQ ID No: 47). This 
region contains no open reading frame other than those already mentioned and cannot 
encode a protein longer than 157 amino acids. 

5 

The protein of Mr 24,000 seen upon in vitro translation appears to correspond to P17 
(SEQ ID No: 48), with the anomaly in apparent size probably being due to the high 
content of proline (P), glutamate (E), serine (S) and threonine (T). These amino acids 
cause the protein run more slowly on a gel thereby giving it an apparent size of Mr 
10 24,000. 

The Mr 24,000 protein (hereinafter referred to as P 17 (SEQ ID No: 48)) may have a 
function in modifying or manipulating the growth characteristics or cell cycle of 
HaSV-infected cells. Although a protein of 16kDa (identified in Example 1) is found 
15 in small amounts in the capsid, it does not react with antiserum against the virus 

particles this is unlikely to correspond to P17 (SEQ ID No: 48), since a preparation of 
the latter proteins migrates with a molecular weight of 24,000 on SDS gels. 

Sequence analysis of the Region from nucleotide 500 to 600 of RNA 2 (SEQ ID No: 
20 47) showed that it has the sequence shown in Fig. 2, as do the plasmids pSR2A, 

pSR2P70, pSR2B and pSXR2P70. However, plasmids pT7T72P65 and pT7T2P70 
have an extra C residue at nucleotide 570. The RNA sequence from which they are 
derived from is shown in Fig. 2 (the "5C" version). In this sequence the first ATG 
(nucleotides 283 to 285) is in the same reading frame as most of the capsid protein 
25 gene. The resultant fusion protein is called "P70" (SEQ ID No: 52) and its 

carboxyterminal-truncated version (a variant of the native P64) is "P65". In view of 
these clones it was considered important to resolve whether any virus RNA carrying 
the extra C residue was present in the viral RNA population first isolated for 
investigation. 



30 
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Direct sequencing of the virus RNA using reverse transcriptase confirmed that the 4C 
version lacking the extra residue was the abundant form of the RNA. In order to 
exclude the possibility of a small amount of the RNA having the extra residue, a 
sensitive PCR assay was designed. This showed that the extra C residue was not 
5 present on any RNA in the viral population, and had been introduced into some clones 
as a PCR artefact. These clones were however retained and used in bacterial 
expression experiments (below) because of the high level expression obtained of the 
P65 and P70 (SEQ ID No: 52) fusion proteins. 

10 vii) Comparison with the sequence of the Nudaurelia w capsid gene 

The sequence of most of the RNA2 of the Nudaurelia w virus has recently been 
published by Agrawal D.K. and Johnson IE. (Virology 190 806-81 4, 1992). From the 
published sequence it has been determined that this sequence shows 63% homology to 

15 that of HaSV RNA2 (SEQ ID No: 47) at the nucleotide level and 66% at the overall 
amino acid level. A detailed comparison of the capsid proteins of these two viruses 
shows the amino-terminal 45 residues to be variable, the next 220 residues to be highly 
conserved, the next 180 residues to be variable and the c-terminal 200 residues 
covering the small protein P7 to be highly conserved. A more detailed comparison is 

20 discussed below. 

The published report did not find a complete reading frame corresponding to the 157 
amino acid protein (PI 7 (SEQ ID No: 48)) gene reported above. The AUG is however 
present, as is a reading frame - starting upstream of the start of the capsid gene - 
25 showing considerable amino acid homology to P17 (SEQ ID No: 48) of HaSV. In 
vitro translation of purified Nudaurelia w virus RNA 2 and a re-examination of the 
nucleotide sequencing data for this RNA may help to resolve the question of whether 
the Nudaurelia w virus also encodes a protein homologous to the HaSV PI 7. 

30 More interestingly, antisera against these two viruses, which are similar at a nucleotide 
sequence level, do not show any cross-reactivity. 
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viii) Construction of full-length clones 

RNA 1 (SEQ ID No: 39) 

cDNA clone E3, described above contains all but the 5-18 nucleotides of RNA I (SEQ 
5 ID No: 39) and included the complete ORF present on the sequence. The first full- 
length clone of RNA 1 (SEQ ID No: 39) is therefore based on E3. The 4.9 kbp Xbal- 
Clal fragment from clone E3 was recloned into pBSKS(-) (Stratagene) cut with Xbal 
and Clal, giving pBSKSE3. 

10 The full-length clone of RNA 1 (SEQ ID No: 39) was completed using PCR. The 
^ primer defining the 5' end of the RNA carried an EcoRI site, the promoter for the SP6 
d RNA polymerase and a sequence corresponding to the 5' 17 nucleotides of RNA 1, as 
%a shown in Figure 1 . The sequence of this primer was: ' 
jj HvRlSPSp: 

Ul 5 5-GGGGGGAATTCATTTAGGTGACACTATAGTTCTGCCTCCCCGGAC (SEQ 
J' ID No: 11) (The G which initiates transcription is underlined) 

Using an oligonucleotide complementary to nucleotides 11 92- 1212, a PCR product of 
p 1240 bp was efficiently made. The template was cDNA synthesised using the MMLV 
2J RTase and the same oligonucleotide complementary to nucleotides 11 92- 1212 was 
ISO the primer. Upon termination of the PCR reaction, the product's ends were made blunt 
and then 5'-phophorylated as described below. The purified PCR fragment was then 
cleaved with restriction endonuclease Xbal and the 450 bp subfragment corresponding 
to the 5' end of RNA 1 (SEQ ID No: 39) cloned into the plasmid pBSSK(-)(Stragene) 
cut with EcoRV and Xbal, to give pB SRI 5 . 

25 

To assemble the full-length of RNA 1 (SEQ ID No: 39), pBSKSE3 (above) was cut 
with Xbal and Seal giving fragments of 1.2 kbp and 6.8 kbp. pBSR15 was cut with 
the same enzymes, giving fragments of 2 and 1.8 kbp. Ligation of the 6.8 kbp 
fragment for pBSKSE3 and the 1.8 kbp fragment for mpBSR15 yielded pSRl(E3)A. 
.30 Upon linearization at Clal and in vitro transcription with the SP6 RNA polymerase, 
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and RNA corresponding to RNA 1 (SEQ ID No: 39), and terminating in a poly(A) 
stretch of about 50 nucleotides, is obtained. 

Since the natural RNA 1 (SEQ ID No: 39) does not have a poly (A) tail, an alternative 
5 plasmid was constructed which carries a BamHI restriction site immediately 

downstream of the 3'end of RNA 1 (SEQ ID No: 39). Again this terminal fragment 
was made using PCR as above. The sequence of the primer was as follows: 
HvR13p: 5-GGGGGGATCCTGGTATCCCAGGGGCGC (SEQ ID No: 12) (the 
nucleotide complementary to that which was determined as the 3' one, based oh its 
10 adjacency to the poly(A) stretch, is underlined; RNA terminating at the BamHI site 
will have the sequence GCGCCCCCUGGGAUACCaggauc (SEQ ID No: 26)). 

The template was clone E3 and an oligonucleotide corresponding to nucleotides 4084 - 
4100 was the other primer. The 1220 bp product was blunt-ended, kinased and gel- 

15 purified as described above, before cleavage with Hindlll. The resulting 420 bp 
subfragment corresponding to the 3' end of RNA 1 (SEQ ID No: 39) cloned into 
plasmid pSRl(E3)A cut with Clal, end-filled with Klenow and then cut with Hindlll. 
The resulting plasmid is pSRl(E3)B. Upon linearization at BamHI and in vitro 
transcription with the SP6 RNA polymerase, and RNA corresponding to RNA 1 (SEQ 

20 ID No: 39), and terminating as described immediately above is obtained. 

ix) RNA 2 (SEQ ID No: 47) 

In constructing the full-length cDNA clone to enable in vitro transcription of this RNA 
hr236 described above was used as a basis. Two separate PCR products, one 
25 corresponding to the 5' portion of RNA 2 (SEQ ID No: 47), which is missing from this 
clone altogether, and another covering the region where clone hr236 lacks the hairpin- 
forming sequence described above, were required. 

The primer defining the 5' end of the RNA carried a Hindlll site and a sequence 
30 corresponding to the 5* 18 nucleotides of RNA 2 (SEQ ID No: 47), as shown in Figure 
2. The sequence of this primer wasr 
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Hr2cdna5: 5'-CCGGAAGCTTGTTTTTCTTTCTTTACCA (SEQ ID No: 13) 
(The nucleotide underlined corresponds to that identified as the first nucleotide of 
RNA2. (SEQ ID No: 47)) 

Using an oligonucleotide complementary to nucleotides 1653 - 1669, a PCR product of 
1 .67 kbp was made. The template was cDNA synthesised using the MMLV RTase 
and an oligonucleotide complementary to the 18 nucleotides at the 3' end of RNA 2 
(SEQ ID No: 47) as the primer. Upon termination of the PCR reaction, the product 
was blunt-ended, kinased and gel-purified as described above, before cleavage with 
PstL The resulting 1.3 kbp subfragment corresponding to the 5' half of RNA 2 (SEQ 
ID No: 47) was cloned into plasmid pBSSKQ (Stragene) cut with EcoRV and PstI, 
giving plasmid pBSR25p. In order to place this subfragment corresponding to the 5* 
half of RNA 2 (SEQ ID No: 47) downstream of the SP6 promoter for in vitro 
transcription, a 1.3 kbp Hindlll - BamHI fragment was excised from pBSR25p and 
ligated into Hindlll - BamHI cut pGEM-1 (Promega), giving plasmid pSR25. 

The second PCR product, covering the region where clone hr236 lacks the hairpin- 
forming sequence described above, was synthesised using as primers oligonucleotides 
corresponding to nucleotide sequence 873 to 889 of RNA 2 (SEQ ID No: 47) and to 
the complement of nucleotide sequence 2290 - 2309. Upon termination of the PCR 
reaction, the product was blunt-ended, kinased and gel-purified as described above, 
before cleavage with AatIL The resulting 1 . 1 kbp subfragment covering the required 
region was cloned into plasmid phr236 cut with Hindlll, end-filled with Klenow and 
cut with Aatll, giving plasmid phr236P70. 

The two segments were joined covering the first 230 nucleotides of RNA 2 (SEQ ID 
No: 47) together. Plasmid phr236P70 was cut at the Sad site in the vector adjacent to 
the 5' end of the insert and this made blunt-ended using Klenow in the absence of 
dNTPs. After heat-inactivation of the Klenow, the plasmid was cut with EcoRI, 
yielding fragments of 4.5 kbp and 380 bp. Plasmid pSR25 was cut with Nhel, blunt- 
ended by end-filling with Klenow and cut with EcoRI, yielding fragments of 2.8 kbp, 
900 bp and 750 bp. The 4.5 kbp fragment of phr236P70 and the 900 bp fragment of 
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pSR25 were ligated to give pSR2P70. This clone covers all of RNA 2 (SEQ ID No: 
47) except for the 3' 169 nucleotides. 

To complete the full-length clone of RNA 2 (SEQ ID No: 47), it was necessary to 
5 insert a fragment covering the 3' end. As with RNA 1 (SEQ ID No: 39), two versions 
were made. One, called pSR2A, used the 3' end as present in phr236, together with the 
poly(A) tail present in this version. The other pSR2B, used a PCR fragment carrying a 
BamHI site immediately downstream of the 3' nucleotide, as in pSRl(E3)B above. To 
construct pSR2A, a 350 bp Notl-Clal fragment was excised from phr236 and cloned 
10 into pSR2P70 cut with the same endonucleases. Linearization at the unique Clal site 
allows in vitro transcription of the complete RNA 2 (SEQ ID No: 47) and a poly(A) 
tail of about 50 nucleotides in length. 

To make pSR2B, an appropriate PCR product was made using as primers an 
15 oligonucleotide corresponding to nucleotide sequence 1 178 to 1 194 and to the 3' 
terminal 18 nucleotides of RNA 2 (SEQ ID No: 47). The latter primer carried a 
BamHII site attached, giving it the sequence: 
HvR23p: 5'-GGGGGATCCGATGGTATCCCGAGGGACGC 
(SEQ ID No: 14) 

20 

The template used was a plasmid phr236. Upon termination of the PCR reaction, the 
product was blunt-ended, kinased and gel-purified as described above, before cleavage 
with Not! The resulting 400 bp subfragment covering the required region was cloned 
into plasmid pSR2P70 cut with Clal, end-filled with Klenow and cut with NotI, giving 
25 plasmid pSRP2B. Linearization at the unique BamHI site allows in vitro transcription 
of the complete RNA 2 (SEQ ID No: 47), terminating with the sequence ACCaggatc. 

x) Construction of pSXR2P70 

This plasmid was made to determine where p24 starts. A 2. 1 kbp XhoI-BamHI 
30 fragment was cut from clone pSR2P70 and ligated into the vector pGrEM-1 (Promega) 
which had been cut with Sail and BamHI. In vitro transcription of the resulting 



51 



plasmid after linearization at the unique BamHI site yielded an RNA covering about 
70 nucleotides upstream of the first ATG at nucleotides 283 to 286, plus a short 
sequence derived from the vector. 

5 In vitro translation of the RNA from pSXR2P70 yielded both proteins (P70 (SEQ ID 
No: 52) + P24). 

xi) Description of virus-induced pathology 

The virus induces a rapid anti-feeding effect in Helicoverpa larvae as determined by 
10 experiments with larvae the results of which are shown in Fig. 3. Fig. 3 shows: A. 

neonate larvae (less than 24 h old) were fed the designated concentrations of isolated 
virus (in particles per ml [of diet] added to solid diet). They were weighed on 
following days and the mean of a statistically significant number (14) of larvae shown. 
Where necessary, mortality was recorded for the higher concentrations. The vertical 
1 5 axis shows the fold-increase in weight from the hatching weight of 0. 1 mg per larvae. 
This scale therefore also corresponds to weight in units of 0. 1 mg (ie 300 is equivalent 
to 30 mg). B. As for A, but the larvae were 5 days old at the start of the virus feeding. 
The vertical scale is in mg weight. 

20 No weight gain at all was detectable with neonates which had been fed the doses of 
virus over 10 8 particles per ml (virus added to diet). In addition, 100% mortality was 
evident after four days at the highest doses. Virus doses as low as 10 6 particles per ml 
(virus added to diet) still cause significant stunting. The five day old larvae showed a 
cessation of feeding after 48 hours and significant stunting at 4 dpi, but no mortality at 

25 comparable virus doses (Figure 3). Neonates are therefore very sensitive indeed to this 
virus. Virus particles accumulate specifically in the midgut. This potent anti-feeding 
effect may be due to the capsid protein or another protein encoded by the virus, or to 
the effect of any combination of such proteins. 

30 xii) Expression of virus-encoded proteins in bacteria. 
The vectors 
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The expression system used initially was derived from the pET-1 1 system (Novagen). 
Trimmed down versions of pET-1 lb and c were constructed and used to compare 
expression of the capsid proteins. However, due to difficulties experienced with this 
system substantial modification of the original vectors was carried out in order to 
5 achieve much higher yields. These results are described in xiii-b) below. 

The initial trimmed-down vectors discussed above were made as follows: pGEM-2 
(Promega) which carries T7 promoter adjacent to a poly-linker sequence, but has no 
sequences corresponding to the lac operon, was cut at the unique Xbal (34) aiid Seal 

10 (1651) sites, giving fragments of 1.61 and 1.25 kbp. The plasmids pET-1 lb and c 

were cut with the same enzymes, giving fragments of 4.77 and 0.91 kbp. The 1.61 kbp 
fragment of pGEM-2, carrying the c-terminal portion of the ampicillin-resistatice gene, 
the origin of replication and the T7 promoter, was then ligated to the 0.9 { kbp 
fragment of the pET vector, which carries a sequence covering the Shine-Dalgarno 

15 sequence, the ATG (in a Ndel site), the terminator for the T7 polymerase and the N- 
terminal portion of the ampicillin-resistance gene. The resulting plasmids of 
approximately 2.53 kbp, called pT7T2-b and c, therefore carry a complete T7 
transcription unit, which may be used as an expression system in a manner similar to 
the original pET-1 1 plasmids, but are repressor-neutral within the cell; they neither 

20 titrate away repressor by carrying a binding site, nor do they carry the gene producing . 
the repressor. They were found to grow very well in E.coli strains JM109 and BL21 
(DE3), and to be very efficient expression vectors. The repressor present in the cells 
was found to be sufficient to keep the genomic T7 polymerase gene uninduced and 
therefore the foreign gene unexpressed in the absence of IPTG. 

25 

xiii-a) Construction of plasmids for expression of capsid proteins 

In this section, all proteins expressed from segments of HaSV RNA 2 (SEQ ID No: 47) 
are referred to by the size of their gene, as defined in Fig. 4 and in section vi) of this 
example. The following plasmids were constructed by PCR, using the 
30 abovementioned full-length clone of RNA 2 (SEQ ID No: 47), plasmid pSR2A as the 
template, except where mentioned otherwise. 
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Groups of plasmids expressed protein starting at each of the first three methionine 
initiation codons found on the sequence of HaSV RNA 2 (SEQ ID No: 47). For those 
proteins initiating at the first methionine initiation codon found on the sequence of 
HaSV RNA 2 (SEQ ID No: 47) (which initiates the P 17 (SEQ ID No: 48) gene; 
oligonucleotide primer HVPET65N (SEQ ID No: 15)), an extra group of plasmids was 
made by PCR using as a template the version of the RNA 2 sequence carrying an extra 
C residue inserted at residue 570 (SEQ ID No: 51) (as depicted in Figure 2). 
Expression constructs initiating at the third methionine initiation codon found on the 
sequence of HaSV RNA 2 (which is located within the P 17 gene; oligonucleotide 
primer HVPET63N (SEQ ID No: 16)) were made by PCR using as a template only the 
version of the RNA 2 sequence carrying an extra C residue inserted at residue 570 
(SEQ ID No: 51). For these latter expression constructs, as well as those designed to 
initiate expression from the second methionine initiation codon found on the sequence 
of HaSV RNA 2 (SEQ ID No: 47) (which initiates the P71 gene; oligonucleotide 
primer HVPET64N (SEQ ID No: 17)), two versions were constructed. 

One version terminated at a point corresponding to the c-terminus of the processed 
(P64) form of the capsid protein and was made using oligonucleotide primer HVP65C 
(SEQ ID No: 19). The other version terminated at a point corresponding to the c- 
terminus of the precursor (P71 (SEQ ID No: 50)) form of the capsid protein and was 
made using oligonucleotide primer HVP6C2 (SEQ ID No: 20). 

The sequence encoding P64 (or the precursor, P71 (SEQ ID No: 50)) was synthesised 
in two segments using PCR. The amino-terminal half of the gene was obtained using 
as primers oligonucleotides incorporating one of the three ATG possible initiation 
codons for the ORF, in addition to an oligonucleotide with the sequence 
TCAGCAGGTGGCATAGG (SEQ ID No: 27); complementary to nucleotides 1653 to 
1669 of the sequence shown in Fig. 2. The forward primers were as follows: 
HVPET65N: 

AAA TA*i TTTTGTTT a pttt a n A a aa A a AT AT A C AT ATGAGCGAGCGAGCAC 
AC (SEQ ID No: 15) 
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(the underlined sequence corresponds to nucleotides 283 to 296 of the sequence shown 
in Figure 2) 

HVPET63N 

5 A A AT A ATTTTGTTT A ACCTT/I AGA AGGAGATCTACAT ATGCTGGAGTGGCG 
TCAC (SEQ ID No: 16) 

(the underlined sequence corresponds to nucleotides 373 to 390 of the sequence shown 
in Figure 2; the Affll (CTTAAG) and Bglll (AGATCT) sites introduced into the 
sequence by single nucleotide changes (shown in italics) in the oligonucleotide are 
10 shown in bold). 
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HVPET64N 

r*c± aca ti^t A r A T A TGGG AG ATGCTGGAGT G (SEQ ID No: 17) 
(the underlined sequence corresponds to nucleotides 366 to 383 of the sequence shown 
in Figure 2; the Bglll site introduced into the sequence by a single nucleotide change 
5 in the oligonucleotide is shown in bold). 

The PCR products obtained from each combination of one of these primers with the 
abovementioned one were treated with the Klenow fragment of E.coli DNA 
polymerase, and then with T4 polynucleotide kinase in the presence of 1 mM ATP, 

10 before purification by agarose gel electrophoresis as described above. Each product 

was then cleaved with Aatll to yield fragments of 0.95 and 0.4 kbp, arid each resulting 
fragment of about .95 kbp cloned intro vector pGEM-2 (Promega) cut with Hindi and 
Aatll, giving plasmids pGEMP63N (in which the insert commenced with 
oligonucleotide HVPET63N (SEQ ID No: 16)), pGEMP64N (in which the insert 

1 5 commenced with oligonucleotide HVPET64N (SEQ ID No: 17)) and pGemP65N (in 

which the insert commenced with oligonucleotide HVPET65N (SEQ ID No: 15)). The 
fragment covering portion of the HaSV capsid gene was then excised with enzymes 
Aatll and XbaL 

20 Two versions of plasmid pGemP65N were made, using different templates as . 

described above. pGemP65N was derived from the sequence of the viral RNA, as in 
plasmid pSF2A; plasmid pGemP65Nc was derived from the sequence carrying an 
extra C residue, as shown in Fig. 2 (see M 5C version"). 



25 
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In parallel, the carboxy-terminal halves of the major capsid protein variant, whether 
terminating as for P64 or for P71 (SEQ ID No: 50), were also produced using PCR. 
An oligonucleotide primer, HVRNA2F3, with the sequence 

GTAGCGAACGTCGAGAA (SEQ ID No; 18) (corresponding to nucleotides 873 to 
5 889 of the sequence shown in Figure 2) was used in conjunction with each of the two 
primers following: 

HVP65C 

GGGGGATCCTC AGTTGTCAGTGGCGGGGTAG (SEQ ID No: 19) 
10 (the underlined sequence is complementary to nucleotides 2072 to 2091 of the 
sequence shown in Figure 2). 

HVP6C2 

GGGGATCC CTAATTGGCACGAGCGGCGC (SEQ ID No: 20) 
15 (the underlined sequence is complementary to nucleotides 2290 to 2309 of the 
sequence shown in Figure 2). 

The PCR products obtained from each combination of one of these primers with the 
above mentioned one (HvRNA2F3 (SEQ ID No: 18)) were treated with the Klenow 

20 fragment of E.coli DNA polymerase, and then with T4 polynucleotide kinase in the 

presence of 1 raM ATP, before purification by agarose gel electrophoresis as described 
above. Each product was then cleaved with Aatll to yield fragments of 0.9 kbp (in the 
case of HVP65C (SEQ ID No: 19)) or LI kbp (in the case of HVP6C2 (SEQ ID No: 
20)) and 0.4 kbp, and each resulting fragment of about .9 or 1.1 kbp cloned into 

25 plasmid phr236 cut with Hindlll, treated with Klenow and Aatll, giving plasmids 

phr236P65C and phr236P70 (which has already been described above), respectively. 
The fragment covering the c-terminus of the capsid protein gene was then excised with 
enzymes Aatll and BamHI. 

30 To assemble plasmids for expression in suitable strains of E. coli P the excised Xbal- 
Aatll fragments of 0.95 kbp covering the amino-terminal half of the gene and the 
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excised Aatll - BamHl fragments of 0.9 or 1.1 kbp covering the carboxy-terminal half 
of the gene were simultaneously ligated into the vector pT7T2 cut with Xbal and 
BamHI. Initial transformation was of £ coli strain JM109. Recombinant plasmids 
carrying the correct insert were then transformed into strain BL21(DE3) for expression 
as described above. 

The plasmid obtained by ligating the aminoterminal fragment commencing with 
oligonucleotide primer HVPET63N (SEQ ID No: 16) to the c-terminal fragment 
ending at oligonucleotide primer HVP65C (SEQ ID No: 19) in the epxression vector 
pT7T2b was called pP65G. 

In the case of plasmid pP64N, containing an insert from HVPET64N (SEQ ID No: 17) 
to HVP65C (SEQ ID No: 19), the fragment covering the amino-terminal half of the 
oligonucleotide was excised by Bglll and Seal from the plasmid pGemP64N and the 
fragment covering the remainder of the gene was excised with Seal and EcoRI from 
plasmid pT7T2-P65. These two fragments were then ligated simultaneously into 
pP65G which had been cut with Bglll sand EcoRI. 

The resulting construct carrying the complete P71 (SEQ ID No: 50) precursor gene 
was called pT7T2-P71 and that carrying the P64 form of the gen was called pT7T2- 
P64. In the case of plasmids derived from pGemP65N and pGemP65Nc, carrying 
inserts commencing as defined by primer HVPET65N, the expression plasmid derived 
from pGemP65N which is based on PCR products made using as the template the 
sequence of the viral RNA, as in plasmid pSR2A, was called pTPl 7; a truncated form 
of this plasmid, which expresses P17 (SEQ ID No: 48), was made by cutting at the 
unique Bglll and BamHI sites, removing the intervening fragment (which corresponds 
to the c-terminal part of the insert) and religating the compatible cohesive ends, to give 
pTP17delBB. The expression plasmids derived from plasmid pGemP65Nc (which 
was derived from the sequence carrying an extra C residue, were called pT7T2-P65 
(carrying an insert terminating at the primer HVP65C (SEQ ID No: 19)) and pT7T2- 
P70 (carrying an insert terminating at the primer HVP6C2 (SEQ ID No: 20)). 
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Expression of P6 

Two forms of this protein, which arises through processing of the large capsid protein 
variant precursor P70 (SEQ ID No: 52) and therefore lacks its own initiation codon, 
were made. One form (protein MA) replaced the phenylalanine at the start of this 
5 protein with methionine, giving it the amino-terminal sequence MAA.. .; the other 
carries an additional methionine residue, giving it the amino-terminal sequence 
MFAA... The oligonucleotides used for PCR-amplified products covering the p6 
coding sequence carried a Ndel site (bold) at the ATG codon, for direct ligation into 
the pET- II vectors. The primers used were: 

10 

HVP6MA: AATTACATATGGCGGCCGCCGTTTCTGCC (SEQ ID No: 21) 
HVP6MF: AATTACATATGTTCGCGGCCGCCGTTTCT (SEQ ID No: 22) 

15 Each of these primers was used in conjunction with primer HVP6C2 (SEQ ID No: 20) 
to generate a PCR product of 0.2 kbp. These products were blunt-end ligated into 
vector pBSSK(-) which had been cut with EcoRV and dephosphorylated. The insert 
corresponding to the p6 gene was excised with Ndel and BamHI (using the BamHI site 
in the primer HVP6C2 (SEQ ID No: 20)) and ligated into the expression vector pET- 

20 lib, which had been cut with the same enzymes. For expression at higher levels, the 
insert was transferred to PT7T2 as a Xbal - BamHI fragment, yielding plasmids 
pTP6MA and pTP6MF. 

IPTG induction of bacteria containing plasmids pTP6MA or pTP6MF were used 
25 produce p6 for bioassay. 

xiii-b) Expression of viral genes in E, coli and bioassay in larvae 
Expression of P64 

IPTG induction of bacteria containing plasmid pT7T2-P65, which contains an insert 
30 running from the location of primer HVPET65N (SEQ ID No: 15) to that of primer 
HVP65C (SEQ ID No: 19), yielded-a protein of molecular weight 68 000. This was 3 
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000 molecular weight greater than the size of the authentic coat protein, as expected. 
Expression of pP65G, which contains an insert running from HVPET63N (SEQ ID 
No: 16) to HVP65C (SEQ ID No: 19), yielded a protein of 65 000 molecular weight. 

5 The authentic capsid protein (P64) was expressed poorly from plasmid pT7T2-P64. 
Recloning this insert as a Ndel-BamHI fragment back into the other form of the vector 
(PT7T2b) did not alter this. 

Expression of P70 

10 IPTG induction of bacteria containing plasmid pT7T2-P70, which contains an insert 
running from the location of primer HVPET65N (SEQ ID No: 15) to that of primer 
HVP6C2 (SEQ ID No: 20), yielded a protein of molecular weight 73 000. This was 3 
000 molecular weight larger than the size of the precursor of the toat protein, as 
expected. 

15 

The authentic capsid protein precursor (P71 (SEQ ID No: 50)) was expressed poorly 
from plasmid pT7T2-P71 . Recloning this insert as a Ndel-BamHI fragment back into 
the other form of the vector (pT7T2b) did not alter this. 

20 Due to the observation mentioned in vi) above, plasmids designed to express all forms 
of the capsid proteins from several possible ATG's at the start of the open reading 
frame were constructed. 

It was found that both authentic P64 and P71 (SEQ ID No: 50) were expressed poorly 
25 in bacteria. In contrast, P17 (SEQ ID No: 48) and the forms of the capsid protein 

commencing at the P 17 ATG were expressed very well. The extra C residue present in 
the latter two constructs resulted in a fusion protein being made from these expression 
plasmid. The sequence of the fusion proteins can be derived from Fig. 2 by including 
an extra C at position 570. The fusion caused the first 67 residues of the HaSV capsid 
30 protein to be replaced by the first 95 residues of P17 (SEQ ID No: 48). Good 

expression of the large capsid precursor and protein was achieved, but the size of these 
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proteins were above 3 kDa larger than the authentic forms. Notwithstanding this the 
expression products of the vectors containing the 5C variant of RNA 2 (SEQ ID No: 
51) are still useful because the resulting product, a P70 (SEQ ID No: 52) variant, is 
only modified at the NH 2 terminus. Since this terminus is thought to be embedded in 
5 the capsid structure and therefore not to participate in the initial interaction with the 
larval midgut cell, the variant is still useful. 

In order to produce constructs which ensure that the expressed proteins possessed the 
native amino terminus, new plasmids carrying the correct sequence were then cloned 
10 into the expression vector (pT7T2). It was found these plasmids to express proteins of 
the correct size. 

The P6 has not yet been to expressed from the new constructs. No evidence has been 
found for processing of P70 to yield the mature proteins in bacteria, nor upon in vitro 
1 5 translation of synthetic full-length RNA 2 (SEQ ID No: 47). 

The P 17 (SEQ ID No: 48) gene has also been cloned into the same vectors for 
expression and bio-assay. This protein accumulates well in bacteria upon induction, 
and electron microscopy analysis has shown it form spectacular honeycomb-like 

20 structures under the bacterial cell wall, completely surrounding the cell interior (results 
not shown). The properties of this protein including its amino acid composition and 
ability to form tube-like structures when expressed in bacteria suggest that it may be 
an homolog of a gap junction protein. The latter is involved in forming the channels 
linking the cytoplasms of adjacent epithelial cells in the insect gut. P 17 could then 

25 play a role in enlarging or forming these channels, thereby enabling cell-to-cell 

movement of the virus in the insect gut, analogous to the movement or spreading 
proteins encoded by plant RNA viruses. 

In order to ensure that the expressed proteins carried the native amino terminus the 
30 correct sequence has also been cloned into the expression vector (pT7T2). The vector 
had been very slightly modified to that described above to introduce two novel 
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restriction sited (for Aflll and BgUI) flanking the Shine-Dalgarno sequence. The 
resulting constructs have been found to be poor producers of the capsid proteins. The 
complete coding regions (which have been completely checked by re- sequencing) 
have therefore been recloned into the more satisfactory vectors. Results using these 
5 constructs suggest that the amino-terminus of the capsid protein presents inherent 
difficulties in expression. These difficulties may be imposed by either the nucleotide 
sequence encoding the amino terminus, or the actual amino acid sequence itself. To 
discriminate between these possibilities, two types of mutants were made in the 
sequence encoding the amino terminal 5 residues of the HaSV capsid protein. These 
10 amino-terminal mutants are as follows: 



HVP71GLY 

CCCATATG GGC GAT GCC GGC GTC GCG TCA CAG (SEQ ID No: 28) 
Met Gly Asp Ala Gly Val Ala Ser Gin (SEQ ID No: 29) 

15 

HVP71SER: 

CCCATATG AGC GAG GCC GGC GTC GCG TCA CAG (SEQ ID No: 30) 
Met Ser Glu Ala Gly Val Ala Ser Gin (SEQ ID No: 3 1) 

20 Native HaSV seq: 

ATG GGA GAT GCT GGA GTG GCG TCA CAG (SEQ ID No 32) 
Met Gly Asp Ala Gly Val Ala Ser Gin (SEQ ID No: 33) 



25 EXAMPLE 4 

EXPRESSION m BACULOVIRUS VECTORS AND BIOASSAY ON LARVAE 

Materials and Methods 

A(i) Cloning of HaSV capsid protein gene. 

30 The capsid protein gene was amplified by PCR using the following primers: 
5 ! primers : 
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HV17V71: 

5' GGGGGATCCCGCGGATTTATGAGCGAG (SEQ ID No: 34) 
HV17E71: 

5' GGGGGATCCCGCGGAGACATGAGCGAGCACAC (SEQ ID No: 35) 
5 HVP71: 

5' GGGGGATCCAGCGACATGAGAGATGCTGGAGTGG 

(SEQ ID No: 36) 

HW71: 

5' GGGGGATCCAGCGACATGAGAGATGCTGGAGTGG 
10 (SEQ ID No: 37) 

The ATG triplets initiating P17 (SEQ ID No: 48) (in HV17V71 (SEQ ID No: 34) and 
HV17E71 (SEQ ID No: 35)) or P71 (SEQ ID No: 50) (in HVP71 and HVV71) are 
underlined) 

15 3' primers : 

Primers HVP65C (SEQ ID No: 19) and HVP6C2 (SEQ ID No: 20), described in 
Example 3. Results section Xiiia, were used. These constructs were made using one 
of the four 5' primers and HVP6C2 (SEQ ID No: 20). Plasmids constructed from PCR 
products made using one of the four 5'- primers and HVP65C (SEQ ID No: 19) are 
20 called 17V64 (made using 5' primer 17E71 (SEQ ID No: 35)), P64 (made using 5' 

primer P71 (SEQ ID No: 36)) and V64 (made using 5' primer V71 (SEQ ID No: 37)). 
These plasmids allow expression of P64. 

25 A(ii) Cloning a full length cDNA of HaSV RNA 1 (SEQ ID No: 39). 

For expression of an RNA transcript corresponding to full length HaSV RNA 1 (SEQ 
ID No: 39), in insect cells by baculovirus infection or plasmid transfection, PCR was 
used to generate a fragment of cDNA linking the 5' end of RNA 1 (SEQ ED No: 39) to 
a Bam HI site. 
30 The primers were: 
HVR1B5 1 
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5' GGGGGATCCGTTCTGCCTCCCCGGAC (SEQ ID No: 38) 
(where the underlined nucleotide represents the start of natural RNA 1 (SEQ ID No : 
39)), and an oligonucleotide complementary to nucleotides 1192=1212 of RNA 1 
(SEQ ID No: 39). 

The template was plasmid pSRl(E3)B described in Example 3 above. 

A segment of the 1240 bp PCR fragment corresponding to the 5' 320 nucleotides of 
RNA 1 (SEQ ID No: 39) was excised with Bam HI and ASC II and cloned into the 
Bam HI site of pBSSK(-)[Stratagene] together with the 5 kbp ASCII - Bam HI' 
fragment of pSRl(E3)B, giving plasmid pBHVRIB, which carries the complete cDNA 
to HaSV RNA 1 (SEQ ID No: 39), flanked by Bam HI sites. 

A(iii) Cloning a full length CDNA of HaSV RNA 2 (SEQ ID fro: 47) 
For expression of an RNA transcript corresponding to full length RNA 2 (SEQ ID No: 
47) in insect cells by baculovirus infection or plasmid transfection, plasmid pB+NR2B 
was made by inserting a fragment carrying Hind III and Bam HI sites from the 
multiple cloning site of vector pBSSK(-) [Stratagene] into plasmid pSR2B described 
above. The resulting plasmid, called pBHVR2B, carried the cDNA corresponding to 
full length HaSV RNA 2 (SEQ ID No: 47), flanked by Bam HI sites. 
A(iv) Baculovirus transfer plasmids. 

Bam HI fragments of 5.3 and 2.5 kbp corresponding to HaSV RNAs 1 and 2 (SEQ ID 
Nos: 39 and 47) respectively, were excised from pBHVRIB and pBHVR2B 
respectively and inserted into the baculovirus transfer vectors described below, which 
had been linearised with Bam HI. 

B. Baculovirus Expression of Proteins. 

Baculovirus transfer vectors and engineered AcMNPV virus were transfected into 
Spodopterafrugiperda (SF9) cells as described by the supplier (Clontech) and as 
described in the following references: 
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Vlak, J.M. & Kens, RJ.A. (1990) in 'Viral Vaccines", Wiley-Liss Inc., NY, pp. 92-128; 
Kitts, P.A. et al (1990) Nucleic Acids Research 18: 5667-5672; Kitts, P.A. and Possee, 
R.P. (in preparation); Possee, RD. (1986) Virus Research, 5: 43-59. 

C. Western Blotting. 
5 As in Example 1 

D. Oligonucleotides. 

The following Ribozyme Oligonucleotides were produced according to standard 
methods. 
10 HVRlCla 

5 1 CCATCGATGCCGGACTGGTATCCCAGGGGG (SEQ ID No: 5) 
5' HVR2Cla 

5 1 CCATCGATGCCGGACTGGTATCCCGAGGGAC (SEQ ID No: 6) 

15 

RZHDV1 

5' CCATCGATGATCCAGCCTCCTCGCGGCGCCGGATGGGCA (SEQ ID No: 7) 
RZHDV2 

20 5' GCTCTAGATCCATTCGCCATCCGAAGATGCCCATCCGGC (SEQ ID No: 8) 



RZHC1 

5' CCATCGATTTATGCCGAGAAGGTAACCAGAGAAACACAC (SEQ ID No: 9) 

25 

RZHC2 

5' GCTCTAGACCAGGTAATATACCACAACGTGTGTTTCTCT (SEQ ID No: 10) 
Results 

30 A series of recombinant baculoviruses has been constructed, based o n the pVL94 1 

transfer vector (PharMingen) or pBakPak8 (Clontech) and the AcMNPV. These are 
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designed to express the correct forms of the precursor and processed HaSV capsid 
proteins (P64 and P71 (SEQ ED No: 50)) as well as the smaller capsid protein P6, and 
P17 (SEQ ID No: 48). In all systems where replicatable RNA encoding the nucleotide 
sequences of the present invention are to be used, such as eukaryotic systems, in order 
to get efficient replication, translation or encapsidation of the RNA it is necessary to 
excise structures downstream of the t-RNA like structure such as the 3' extension or 
poly A tail on the RNA. In order to carry out such an excision, ribozymes or other 
suitable mechanisms may be employed. This self cleavage activity of the ribozyme 
containing transcript should proceed at such a rate that most of the transcript is' 
transported into the cytoplasm of the cell before the regeneration of a replicatable 3' 
end occurs. Such ribozyme systems are more fully explained in Examples 7 and 9. In 
the results presented here highly efficient production of P64 and P71 (SEQ ID No: 50) 
has been achieved. Electron microscopy and density gradient analysis have confirmed 
that empty particles ("capsoids") are being produced in infected cells that efficiently 
express the P71 precursor gene. P17 (SEQ ID No: 48) placed in the context of the 
H. virescens juvenile hormone esterase (JHE) gene (Hanzlik T.N., et al, J. Biol. Chem. 
264, 12419-25 (1989)) is produced, but not in large amounts. The latter construct 
results in a reduction of expression of the capsid protein from the same recombinant, 
presumably due to a reduction in the number of ribosomes reaching the AUG for the 
capsid gene. 

SF9 cells infected with recombinant baculovirus have been shown to contain large 
amounts of icosahedral virus particles by electron microscopy (data not shown). These 
particles contained no RNA, and were empty inside. This observation shows that 
signals on the viral RNA required for encapsidation of RNA must be located in either 
the 5' 270 nucleotides or the 3' 170 nucleotides, or both, since these sequences were 
missing from the RNA transcripts made using recombinant baculovirus. Expression of 
HaSV proteins was confirmed by Western blotting of total protein extracts from 
infected insect cells. 
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In addition, the pAcUW3 1 vector (Clontech), which carries two promoters, is being 
used to simultaneously express p6 and p64 as separate proteins. 

In order to bioassay the capsid protein produced in baculovirus infected cells, it is first 
necessary to purify it from the baculovirus expression vector. Preliminary attempts 
5 have made use of density gradients, based on the observation that empty virus particles 
("assembled capsids") are in fact produced in infected cells. 

As outlined earlier, the HaSV genome or portion thereof is a particularly effective 
insecticidal agent for insertion into baculovirus vectors. Such a vector is constructed 

10 by insertion of the complete virus genome or portion thereof (preferably the replicase 
gene) into the baculovirus genome as shown in Fig. 13. Preferably the virus genome 
or replicase is transcribed from a promoter active constitutively in insect cells or active 
at early stages upon baculovirus infection. An example of such a promoter is the heat 
shock promoter described in Example 7. Heat shock promoters are also activated in 

15 stressed cells, for example cells stressed by baculovirus infection. An even more 

preferable use of such a baculovirus construct is to use the HSP promoter to drive the 
HaSV replicase and another gene for a toxin (as exemplified elsewhere in the 
specification) where the RNA expressing the toxin gene is capable of being replicated 
by the HaSV replicase. Such recombinant bacuioviruses carrying the HaSV genome 

20 or portions thereof for expression in larvae at early or other stages of the baculovirus 
infection cycle are particularly effective biological insecticides, 

EXAMPLE 5 

25 EFFECT OF HaSV GENES AND THEIR PRODUCTS ON PLANTS 

Materials and Methods 
A. Electroporation of protoplasts. 

Protoplasts of Nicotiana tobacum, K plumbaginifolia and Triticum aesticum and oats 
were produced and electroporated with either HaSV or HaSV RNA as described in 
30 Matsunaga et al (1992) J.Gen. Virol 73: 763-766. 
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B. Northern blot analysis - RNA extraction from protoplasts after harvest 

The protoplasts are subjected to 3 cycles of freezing and thawing, and then an equal 
volume of 2x extraction buffer (100 mM Tris-HCl ? pH 7.5, 25 mM EDTA, 1% SDS, 
made in DEPC treated water) is added, followed by 1 volume of phenol (equilibrated 
5 in 10 mM Tris-HCl pH 8.0) heated to 65 °C. The samples are mixed by vortexing and 
incubated at 65 °C for 15 min, vortexing every 5 min. After phase separation by 
centrifugation at room temperature for 5 min, the aqueous phase is re-extracted with 
phenol, re separated by centrifugation and re-extracted with chloroform/isoamyl 
alcohol. To the aqueous phase are then added 0.1 volume of DEPC-treated sodium 
10 acetate (pH 5.0) and 2 volumes of ethanol. The RNA is recovered by precipitation at - 
70°C, followed by centrifugation at 4°C for 15 min. The samples were then analysed 
by agarose gel electrophoresis as described in example 1. 

After blotting to Zeta-Probe membrane (BioRad), the hybridization protocols were as 
1 5 above for Example 2. 
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C. Total protein from HaSV - electroporated protoplasts. 

Protoplasts were analysed by SDS-polyacrylamide gel electrophoresis and Western 
blotting as described in Example 1 . 

5 Results 

i) Use of complete (replication-competent) RNA virus genome in protoplasts 

a) HaSV replication in protoplasts 

The nodavirus FHV has previously been shown to replicate in barley protoplasts 
(Selling H.H., Allison, R. F. and Kaesberg, P. Proc. Natl Acad. Sci. USA 87,434-8 

10 (1990). To determine whether HaSV virus RNA can replicate in plants protoplasts, 
when introduced by electr op oration, experiments using protoplasts from Nicotiana 
plumbaginifoli and wheat have been conducted. (These are all species for which 
protoplasts are regularly available in the Division of Plant industry). Assays for 
replication including RNA (Northern) blots using probes derived from cloned 

15 fragments of cDNA to RNAs 1 and 2 (SEQ ID Nos: 39 and 47), and Western blots, 
using the antiserum to purified HaSV particles. Initial experiments showed that both 
HaSV virus and RNA electroporated into protoplasts of N. plumbaginifolia resulted in 
HaSV replication as studied using and verified by northern blots and ELIS A. As a 
positive control TMV RNA was electroporated and was replication observed. 

20 

b) Bioassays 

Protoplasts into which HaSV RNA had been introduced by electrop oration were 
harvested after 6 or 7 days post electroporation and used in bioassays on neonate 
larvae by addition to normal diet. The results showed significant stunting of test larvae 
25 in comparison to control larvae (see Table 1 below). Protoplasts lacking HaSV RNAs 
had no effect on the larvae, confirming the result of control experiments. This result 
confirms that HaSV RNA, when expressed or replicated in plant cells, is able to cause 
the formation of infectious virus particles able to control insect larvae feeding on the 
plant material. 

30 
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Northern blotting has been used to confirm that RNA electroporation into protoplasts 
leads to RNA replication. 

Table 1: Results of Bioassay from a typical experiment with Nicotiana and oat 
5 protoplasts (oat results are shown in brackets) [see over] 





Treatment 


Number 


Escapes 


Number 










stunted 


1. 


diet only 


12(12) 


2(3) 


0/10(0/9) 


2. 


diet+protoplasts 


12 (12) 


0(1) 


0/12 (0/11) 


3. 


HaSV+diet 


12(12) 


0(1) 


12/12(11/11) 


4. 


diet+HaSV/protoplasts 


12 (n.d.) 


0 (n.d.) 


12/12 (n.d.) 


5. 


diet+RNA/protoplasts 


12 (12) 


0(0) 


11/12 (10712) 



HaSv replication in the larvae was confirmed except for two larvae 
1 5 which were dead. The letters "n.d." mean the experiment was not doae. 

The above results demonstrate assembly of HaSV particles from electr op orated RNA 
in protoplasts of both moncot and dicot plant species. 

20 c) Plasmids to test replication of cloned and engineered forms of HaSV 

(1) Plasmids allowing in vitro transcription of HaSV RNAs 1 and 2 (SEQ ID Nos: 39 
and 47) for electroporation into protoplasts have already been described above. 

(2) Plasmids for transient expression of individual HaSV RNAs (1 or 2) (SEQ ID Nos: 
39 and 47) in protoplasts. Full-length cDNAs for the two viral RNAs have been 

25 inserted into expression plasmids pDH51 (with the CaMV 35 S promoter. Pietrzak 

M., et al (9186) Nucl Acids Res. 14, 5857-68) for dicots and pActlxas (with the rice 
actin promoter) for monocots (McElroy et al (1990) The Plant Cell 2: 163-171). As 
with the vectors for expression in insect cells, these expression plasmids are being 
modified to include a cis-acting ribozyme for generation of authentic ends. The non- 
30 ribozyme plasmids gave no virus replication. 

ii) Expression of capsid protein in plants 
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In view of the present inventors' observation that empty particles ("assembled 
capsids") are being produced in baculovirus-infected cells that efficiently express the 
P71 precursor gene, expression of the coding region for the capsid protein in tobacco 
plants was investigated. The vector chosen for this purpose is based on pDH5 1 which 
carries the CaMV 35 S promoter and polyadenylation signal If necessary for improved 
expression, this vector can be modified by the addition of a translation enhancer 
sequence from e.g. TMV. Although certain groups have constructed transgenic plants 
expressing the capsid proteins of plant viruses, there has been only one recent report of 
assembly of empty capsids in such plants (Bertioli et al,(1991) J. gen. Virol. 72: 1801- 
9). Bertioli et al point out that the protein-protein interactions in most icosohedral plant 
RNA viruses may be too weak to allow assembly of such capsids. In addition to the 
present inventors' observation of empty HaSV capsids, it has been found these capsids 
are very tough, showing great resilience to e.g. repeated cycles of freezing and 
thawing, so that it is expected to see assembly of empty HaSV capsids ("assembled 
capsids") in transgenic plants. 

Construction of capsid protein expression plasmid. 

Vector used was pDH5 1 ; linearised with BamHI and phosphatased. , 
Insert was PCR product made using following 2 primers: 
CAPPLANT: 

5' G GGGATCC ACA ATG GGA GAT GCT GGA GTC -3' 
(BamHI) 

(i.e. A BamHI site followed by plant consensus context for ATG of capsid protein 
gene and 15 further nucleotides of this gene - nts 366-383 of HaSV RNA2). 
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HVP6C2 (Example 3) 

The PCR product was made with VENT polymer (New England Biolabs). After gel 
purification, it was cut with BamHI and cloned into the vector. Orientation screened 
with EcoRI to identify insert in same direction as promoter giving plasmid 
5 pDHVCAPB. Expression was verified by Western blotting using anti-HaSV 

antiserum. Both precursor P71 and processed P64 capsid protein were detected in 
protoplasts following transfection with pDHVCAPB, showing assembly of virus-like 
particles. 

10 EXAMPLE 6 

IDENTIFICATION OF MIDGUT BINDING DOMAINS 
Materials & Methods 

A. Plasmid construction 

Was as described in Examples 3 and 4. 

15 

B. Western blotting 

Was as described in Examples 1 and 3. 

C. Invitro translation 

20 In vitro transcripts of cloned CDNA of HaSV RNA's was translated in vitro as in 
Examples 1 and 3 . 

D. Preparation of Brush Border Membrane Vesicles. 

Brush Border Membrane Vesicles were prepared from freshly isolated larvae midguts 
25 of HArmigera by the method of MWolfersberger et al (1987) Comp. Biochem. 

Physiol 86A: 301-308, as modified by S.F.Garczyuski et.al (1991) Applied Environ. 
Micro-biol 57: 1816-2820. Brush Border Membrane Vesicles binding assays using 
invitro labelled protein or 125 I-labelled protein were as described in Garczynski et.al 
(1991) or in H.M.Horton and Burand, J.P. (1993) J.Virol 67: 1860- L868. 
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Results 

i) Determination of epitopes on the capsid surface 

Comparison of the recently published sequence of the Nudaurelia a> virus (MvV) 
capsid protein with that of HaSV shown that these proteins are closely related and fall 
into four distinct domains, which are alternatively variable and highly conserved. 
These domains are summarised as follows: 



Residues: 



% identity: 



HaSV 
MgV 



1-49 
1-46 

37 



50-272 
47-269 
81 



273-435 
270-430 
34 



437-647 
431t645 
81 



Comparison of this observation with the alignment by Agrawal and Johnson (1992) 
between the JVwV and the nodavirus BBV (whose crystal structure is known: Hosur et 
al (1987) Proteins: Structure, Function & Genetics 2: 167-176) showed that the 
variable region coincided with a region forming the most prominent surface protrusion 
on the BBV capsid. Both HaSV and TVwV carry large insertions at this point relative 
to BBV, and these insertions are largely different in sequence. Assuming that the 
alignment by Agrawal and Johnson (1992) is correct, then this means that HaSV and 
NwV have a more prominent pyramid-like structures as a surface protrusion than do 
the nodaviruses, and the pyramid-like structures are different. As already noted, there 
is no immunological cross-reactivity between the two viruses, despite the high degree 
of identity. There is thus a strong implication of the variable domain as a surface 
protrusion which functions as the sole antigenic region. 

To confirm this a 400 bp Narl fragment spanning the variable region was deleted from 
the capsid gene in the expression vector. With end-filling of these sites the deletion is 
in-frame, so that a truncated protein of ca. 57 KDa is produced in bacteria upon 
induction. This protein was recognized only poorly on Western blots by the antiserum 
against intact HaSV particles made in rabbits. The central variable domain was 
recognized well by the antiserum when expressed in isolation from the rest of the 
capsid gene. 
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As shown in the table above the region of HaSV capsid protein comprising residues 
273-439 shows great divergence form the corresponding region of the NwV capsid 
protein, compared to its immediate flanking regions. Within this region an especially 
divergent domain is found from residue 35 1 to residue 411, which shows only 25% 
5 identity to the corresponding region of the NwV capsid protein. This region is 

flanked by the sequences corresponding to the b-sheet structural features b-E(residues 
339-349) and b-F(residues 424-431) of the HaSV capsid protein, based on the 
alignment the NwV and nodavirus capsid proteins by Agrawal and Johnson (1992), 
and is therefore likely to form the loop of the most prominent surface protrusion on the ' 
10 HaSV capsid. This is based on comparison and correspondence to the nodavirus 

capsid protein structure and capsid structure as described by Wery J.-P. and Johnson, 
2 IE. (1989) Analytical Chemistry 61, 1341A-1350A and Kaesberg, P., et al. (1990) J. 
J^J Mol. Biol. 214, 423-435. This loop is thought to contain important epitopes. It is 
y significant that this exterior loop on the nodavirus capsid protein is one of the most 
y|5 variable regions when capsid proteins sequences from a number of nodaviruses are 
Ly compared (Kaesberg et al. 1990). 

Finally, the present inventors have observed a significant level of immunological 
ly cross-reaction on Western blots, between antisera against the CrylA(c) Bt toxin and 
QO HaSV capsid protein, whether obtained from virus or expressed in bacteria. Initial 

data from the Narl deletion mutant described above suggest that this binding is not to 
the central variable domain, but to other regions of the capsid protein. The only other 
region of the proteins which shows extensive sequence variability, the amino terminus, 
cannot be responsible for the binding, since both authentic capsid protein and the 
25 protein with an altered amino terminus expressed in bacteria are recognized by the anti 
Bt antisera. 

ii) In- Vitro binding assays 

30 The full-length clones for in vitro translation yielding highly 35 S or 3 H labelled proteins 
were constructed by replacing the bacterial translation interaction signal in the T7 
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plasmids above by the more active eucaryotic context sequence from the JHE gene. 
The labelled capsid protein made by in vitro translation of the in vitro transcripts may 
be tested for binding to brush border membrane vesicles (BBMV's). Conditions are 
optimised by testing different procedures. The deletion mutant lacking approximately 
5 125 amino acids in the central region, and containing the variable domain, as well as 
others derived from it are also tested. 



iii) Fusion proteins comprising virus capsid midgut binding domains and 
other proteins 

10 The idea behind these tests is to fiise the binding domain from the HaSV capsid protein 
to either large proteins (preferably indigestible, causing protein to aggregate in or on 
the midgut cells) or toxin domains from other proteins with suitable properties but 
normally different binding specificities (e.g. Bt). In initial experiments, the gene for 
the complete capsid protein has been fused to the GUS gene, as has a deletion mutant 

1 5 containing essentially only the central portion of the capsid gene. The resulting fusion 
proteins are being expressed in bacteria and tested for GUS activity, and makes them 
sensitive probes for binding experiments on midgut tissue. 



iv) Mapping binding sites using Bt/HaSV fusion proteins 

20 Analysis of deletion mutants of the CrylA(c) Bt toxin has identified domains which 
may be involved in determining the host-specificity of this Bt by acting as receptor- 
binding sites (Schnepf et al (1990) J. Biol. Chem. 265: 20923-20930; Li et al (1991), 
Nature 353: 815-21. The present inventors have obtained a clone of this toxin gene. 
Deletion mutants corresponding to those identified by Schnepf et al are constructed. 

25 Segments of the HaSV capsid protein gene can then be inserted into these mutants, the 
protein expressed in bacteria and their insecticidal function assayed. 



EXAMPLE 7 
VIRAL GROWTH IN CELL CULTURE 
30 Materials & Methods 
A. Cell Lines 
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The following cultured insect cell lines were tested for infection by HaS V: 
Drosophila melanogaster, Helicoverpa armigera (ovarian derived), Heliothis zea 
(ovarian derived), Plutella xylostella, Spodoptera frigiperda (SF9). 
All lines were grown under standard conditions. Upon reaching confluence, the 
culture medium was removed and all mono-layers covered with 1.5 ml of cell culture 
medium into which HaSV had been diluted; the average multiplicity of infection 
(M.Oi.) was 10 4 After adsorption at 26°C for 2h, the inoculum was removed, the 
cells carefully washed twice with phosphate buffered saline (pH 7.0) and incubation 
continued with 5 ml of 10%. Foetal calf serum in TC199 culture medium (Cyto' 
Systems). 

B. Northern Blotting Analysis. 

Virus replication in all the above cell lines was confirmed by northern blotting 
analysis. Total RNA was extracted from infected cells by the method of Chomczynski 
and Sacchi (1987). Anal. Biochem. 162: 156-159. The cells were lysed in 1 ml of lysis 
solution (4M guanidinium thiocyanate, 25mM sodium citrate, pH 7, 0.5% sarcosyl, 
0.1M 2-mercaptoethanol). In order, 0.1 ml of 2M sodium acetate, pH 4, 1 ml of 
phenol (0.2M sodium acetate equilibrated), and 0.2 ml of chloroform-isoamyl alcohol 
mixture (49: 1) were added with thorough mixing between reagents. This was then 
vortexed for 10 s and cooled on ice for 15 min. Tubes were centrifuged in an 
Eppendorf centrifuge at 14k for 15 min at 4°C for at least 15 min to allow RNA 
precipitation. RNA was pelleted by centrifugation at 14k for 15 min, washed with 0.6 
ml of ice-cold 70% ethanol, pelleted once again (10K, 10 min), air dried at room 
temperature and resuspended in DEPC (Sigma) treated millipore water. RNA was 
subject to denaturing agarose gel electrophoresis in the presence of formaldehyde 
according to Sambrook et.al. (1989). The gel was Northern transferred to a zeta-probe 
membrane (Biorad) as described by Sambrook et.al (1989). The probe was prepared 
by random-priming the 3' sequences of the HaSV genome using DNA and cDNA 
clones pSHVR15GB and pT7T2p71SR-l as per manufacturer's instructions 
(Boehringer-Mannheim). Hybridization was carried out as described for the standard 
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DNA probe protocol contained within the literature for the zeta-probe membrane 
(Biorad). 

C. Vectors 

5 Vectors as described below. 

Results 

It has been found that HaSV will replicate in several continuous cell lines, of which 
the best is the Spodopterafrugiperda line SF9. Time course assays by Northerfi 

10 blotting in SF9 cells have shown that RNA 1 (SEQ ID No: 39) replication is clearly 

detectable within a few hours of infection. RNA 2 (SEQ ID No; 47) is present only in 
very small amounts early in infection and accumulates much more slowly than RNA 1 
(SEQ ID No: 39) does. This observation is consistent with one made earlier in HaSV- 
infected larvae, where RNA 2 (SEQ ID No: 47) replication was not observed until 3 

1 5 days after infection. 

Some apparent replication was also observed in Drosophila cells (DL2), but with the 
difference that more RNA 2 (SEQ ID No: 47) replication was observed at the early 
time points compared to the lepidopteran cell lines above. 

20 

Plasmids that express the HaSV genome as RNA transcripts from full length cDNA 
clones have been constructed and tested. These clones, constructed by PCR and 
carefully checked, have restriction sited immediately adjacent to the ends of the 
sequence. Transcription is driven from a specially-re-engineered Drosophila HSP70 
25 promoter. 
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i) Constructs for expression in insect cells 

The constructs are based on vectors carrying the Drosophila HSP 70 or actin promoters 
and suitable polyadenylation signals from Drosphila (Corces & Pellicer (1984) J. Biol. 
Chera 259: 14812-14817) or SV40 (Angelichio et al (1991) Nucl. Acids. Res. 18: 
5037-5043). Since transcription from such plasmids generates viral RNAs carrying 
long 3' terminal extensions derived from sequences in the poladenylation signal 
fragment, it is necessary to achieve cleavage of the transcript immediately after the 
3'sequence of the viral RNA. These plasmids gave no virus replication, presumably 
because of the 3' terminal extension. The method of choice for obtaining authentic 3' 
termini is based on introduction of DNA sequences encoding a cis-acting ribozyme 
into the constructs. With suitable engineering, such a ribozyme will cleave 
immediately 3' to the viral sequences within the transcript. Suitable ribozymes, based 
on the hepatitis delta virus (Been M.D., Perrotta, A. T. & Rosenstein, S.P. 
Biochemistry 3 1, 1 1843-52 (1992) or the hairpin cassette ribozyme (Altschuler, M, 
Tritz R. & Hampel, A. Gene 122, 85-90 (1992) have been designed (see Example 4). 
This involves synthesis of overlapping oligonucleotides, which are then annealed and 
end-filled with the Klenow fragment of DNA polymerase, to create short DNA 
fragments encoding the desired ribozyme. These fragments carry restriction sites at 
their termini allowing them to be ligated into plasmids between the viral RNA cDNA 
(which has a 3' restriction site added by PCR) and the restriction fragment carrying the 
poladenylation signal. Ribozyme function has been verified (Example 9). 

The Drosophila HSP70 promoter was joined to the HaSV RNA 1 sequence as follows. 
A BamHI restriction site was introduced into the promoter sequence as described on 
p. 5 of this specification. Oligonucleotide HVR1B5P described in Example 8 was used 
to prime PCR of RNA 1 to yield a cDNA copy of the RNA carrying a BamHI 
restriction site 5' to the RNA 1 sequence and separated from it by the nucleotides ACA 
which end the HSP70 promoter just before the start of transcription. This common 
BamHI site was used to link the HSP70 promoter and the HaSV RNA 1 sequence. 
The resulting plasmid was completed by adding either the hairpin cassette ribozyme 
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(giving plasmid pHSPRlHC) or the HDV ribozyme (giving plasmid pHSPRlHDV) 
plus the SV40 late polyadenylation sequence. 

A similar approach was used to obtain plasmids for RNA 2 i.e. pHSPR2HC and 
5 pHSPR2HDV. 

An alternative approach is to link the promoter and the HaS V cDNAs using blunt end 
ligation of a DNA fragment and carrying the promoter and terminating at the last 
nucleotide before the start of transcription (the underlined residue in ACA) and the 
10 cDNA fragments corresponding to either HASV RNA 1 or 2, as described for the plant 
expression plasmids in Example 9. 

The latter approach was used to join the sarcoma virus (RSV) long terminal repeat 
(LTR) promoter to the HaSV cDNAs for expression in insect cells. The RSV LTR 

15 promoter is active in many animal cells (Cullen, B.R. Raymond, K. & Ju, G. (1985) 
MoL Cell. Biol. 5,438-447) and also in lepidopteran cell lines (D. Miller personal 
communication). It was obtained from plasmid pRSVCAT (Gorman, C, 
Padmanabhan, R. & Howard, B.H., (1983) Science 221, 551-553) as a 495 bp 
fragment carrying a 5-XbaI site (added by PCR) and terminating at a blunt end with 

20 the sequence AAC, with the underlined residue corresponding to that immediately 
before the start of transcription. The resulting plasmids, pRSVRlHCLA and 
pRSVR2HCLA, carry the HaSV RNA 1 and 2 cDNAs, respectively, and are otherwise 
like pHSPRlHC and pHSPR2HC, respectively. These plasmids carry the SV40 late 
polyadenylation signal. They allow efficient and precise expression of the HaSV 

25 genomic RNAs in insect cells, for example if introduced using a baculovirus vector or 
by transfection. 
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EXAMPLE 8 
SHEDDING OF INFECTED CELLS 

Materials & Methods 
5 A. Confocal Laser Scanning Microscopy. (CLSM) 

CLSM enables the visualisation and analysis of three-dimensional cell and tissue 
structures at the macro and molecular levels. The Leica CLSM used in this example is 
based on an MC 68020/6888 1 VME bus (20MHz) with standard 2Mbyte framestore 
and 4Mbyte RAM and OS9 operating system with programmes written in C code. It 

10 incorporates a Leica Diaplan research microscope and using XI 0/0.45, 

X25/0.75,X40/1.30 and X63/1.30 Fluotar objectives has a claimed optical efficiency 
better than 90%. The confocal pinhole is software controlled over the range of 20 to 
200 mm. Excitation at 488 and 5 14 nm is provided by a 2 to 50 mW argon-ion laser. 
B, Immunocytochemistry (ICC). 

1 5 For whole mount ICC, tissues were dissected under saline and fixed in fresh 4% 

formaldehyde in phosphate buffered saline (PBS) for at least 15 mins. After multiple 
washes in PBS they were permeablized either by 60 mins incubation in PBT (PLBS 
with 0.1% Triton X-100 plus 0.2% bovine serum albumin). After 30 mins blocking in 
PBT+N (5% normal goat serum) tissue was incubated in primary antibody diluted 

20 (1 :40) in PBT+N for at least 2 hrs at room temperature then at 4°C overnight. After 
extensive washing in PBT and 30 mins blocking in PBT+N the FITC conjugated 
secondary antibody diluted (1 :60) in PBT+N was incubated for 2 hrs at room 
temperature plus overnight at 4°C. After multiple washes in PBT and PBS the tissue 
was cleared in 70% glycerol and mounted in 0.01%w/v p-phenylenediamine 

25 (Sigma#P 1519) dissolved in 70% glycerol All processing was at room temperature 
unless otherwise stated. 
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Results 

The inventors' current model for the effect of HaSV involves the detection by the 
insect midgut of infected cells, their identification as infected and their subsequent 
shedding in numbers sufficient to cause irreparable damage to the insect midgut. The 
evidence for this is based on the above and on the following direct observation of the 
fate of infected cells in midgut tissue over 1-3 days post infection. These results in 
repeat experiments were complicated by the discovery that another unrelated virus was 
present in the larval population being tested. Preliminary findings indicated that HaSV 
infection activates or facilitates pathogenesis of the unrelated virus and together these 
cause severe disruption of the larval gut cells. Thus these two agents appear to act 
synergistically in causing gut cell disruption. 

Midguts from larvae infected with HaSV were treated with the antiserum to purified 
HaSV particles (above) and examined under the Laser confocal microscope (described 
above). This established that some midgut cells were sufficiently infected with HaSV 
to give strong fluorescence signals. Such cells were moreover clearly separating from 
the surrounding tissue, a sign that they were in the process of being shed. 

Similar observation have been made with other insect viruses (Flipsen et al (1992) 
Society for Invertebrate Pathology Abstract #96) although in these cases the effect is 
too localised and weak to cause any anti-feeding effect apparently only the small RNA 
virus of the tetraviridae which are localised to the gut and cause more-or-less severe 
anti-feeding effects in their hosts (Moore, N.F. in Kurstak E. (Ed) (1991) Viruses of 
Invertebrates. Marcel Dekker, New York pp277-285) are capable of such an effect to 
an extent sufficient for pest control. 

Following on from the immune-fluorescence work, in situ hybridization can be carried 
out to detect RNA replication in infected cells.Furthermore, larvae infected with a 
recombinant HaSV expressing a foreign gene at early stages (by insertion of that gene 
into RNA 1 in place of the N-terminal portion of the replicase gene) can be studied. A 
correlation between virus replication and cell rejection can be confirmed by 
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histochemical analysis of the midgut cells of the infected larvae. Thus the cell- 
shedding phenomenon offers a direct and rapid assay for early events in HaSV- 
infected gut tissue. Extracts of baculo-vector infected insect cells carrying empty 
HaSV particles can be fed to larvae directly and the midgut examined by toluidine blue 
staining and immune-fluorescence at intervals after infection. This will allow direct 
determination of whether the particles can bind the brush border membranes in intact 
gut, and whether such binding can induce the massive disruption evident in normally 
infected larvae. Control experiments using extracts from cells infected with the 
baculovector alone can be conducted to observe and distinguish effects due to the 
vector. The immune-fluorescence assay on midgut tissue allows analysis of binding to 
midgut brushborder membranes. Once determined for wild-type capsid protein 
expressed from a baculo-vector, deletion or replacement mutants can be inserted into 
the baculovectors. Suitable cell extracts from these can be used to* infect larvae. 



EXAMPLE 9 
ENGINEERED VIRUS AND USES 

Materials & Methods 

(as indicated in earlier Examples) 

i) Engineered virus as a vector for other toxin genes 

This involves placing suitable genes under control of HaSV replication and 
encapsidation signals. Genes which may be suitable include intracellular insect toxins 
such as ricin, neurotoxins, gelonin and diphtheria toxins. The toxin gene may be 
placed in the viral gene such that it is a silent (downstream) cistron on a polycistronic 
RNA, or in a minus strand orientation, requiring replication by the viral polymerase to 
be expressed. Standard techniques in molecular biology can be used to engineer these 
vectors. 

A discussion of two recombinant HaSV vectors which have been designed is given 
below: 
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for RNA 1 (SEQ ID No: 39): 

The reporter gene (or one of the toxin genes mentioned above) is inserted in place of 
the amino-terminal portion of the putative replicase gene, such that the intiation codon 
used for the replicase (ie that at nucleotides 37-39 of the sequence) is now used to 
commence reporter gene translation. The fusion is achieved by the use of artificial 
Ncol restriction sites common to both sequences. 

The short 36 nucleotide S'-untranslated leader of RNA 1 (SEQ ID No: 39) (shown in 
upper case) is synthesised as the following sequence: 

ggggatccacaGTTCTGCCTCCCCCGGACGGT AAAT AT AGGGGAA CC ATG ' 
Gtctagagg, (SEQ ID No: 53) 

using two overlapping oligonucleotides comprising the first 3 1 (oligonucleotide 
HVR1B5P) nucleotides and the complement of the last 40 nucleotides (oligonucleotide 
HVR1NCO) respectively. These primers are annealed and end-filled by Klenow. The 
resulting fragment is then cut with BamHI and Xbal (sites underlined) and cloned with 
plasmid vector pBSIISK(-) to give pBSSKRINCO. 

The GUS gene carrying a Ncol site at the ATG codon was obtained as a NcoI-SacI 
fragment from plasmid pRAJ275 (Jefferson, RAJ Plant Mol. Biol Rep 5, 3387-405 
(1987)). This Sad site is located just downstream from the coding sequence for the 
GUS gene. 

The 5' leader of HaSV RNA1 is excised as a BamHI-Ncol fragment from the plasmid 
pBSSKRINCO, and is ligated together with the NcoI-SacI fragment carrying the GUS 
gene into plasmid pHSPRlHC or pHSPRlHDV or pDHStuRlHC carrying the full- 
length cDNA insert of RNA 1 (see above) which has been cut with BamHI and Sad. 
The resulting plasmid then carries a complete form of RNA 1 (SEQ ID No: 39) but 
with the amino-terminal portion of the replicase gene substituted by the GUS gene. It 
is desirable to produce a construct with approximately the same size as RNA 1 (SEQ 
ID No: 39) for encapsidation purposes. 

Similar approaches are adopted for RNA 2 (SEQ ID No: 47), with the foreign, reporter 
or toxin gene fused to the initiation codon of either P17 or P71 . In either case the 
context sequence of the introduced gene is modified to give the necessary expression 
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level of that protein. The foreign gene is introduced into plasmids pHSPR2HC or 
pHSPR2HDV or pDHStuR2HC. 

The above recombinants have been described specifically as insertions of a reporter 
gene (GUS). The toxin genes to be inserted are described on page 14 of the 
specification. These preferably further require a signal peptide sequence added at the 
amino-terminus of the protein. 

ii) Capsid technology 

Identification of encapsidation (and replication) signals on virus RNA allows design of 
RNAs which can be encapsidated in HaSV particles during assembly of virus in a 
suitable production system. The virus capsids then carry the RNA of choice into the 
insects midgut cells where the RNA can perform its intended function. Examples of 
RNAs which may be encapsidated in this manner include RNAs for specific toxins 
such as intracellular toxins, such as ricin, gelonin, diptheria toxins or neurotoxins. 
This strategy is based on the resistance of the virus particle to the harsh gut 
environment. 

iii) Other uses of the capsid particle 

The capsid particles can be used as vectors for protein toxins. Knowledge of 
icosahedral particle structure elucidated by the inventors suggests that the amino and 
especially the C-termini are present within the capsid interior. It is possible to replace 
or modify the amino acid sequence corresponding to P7 such that it encodes a suitable 
protein toxin which is cleaved off the bulk of the capsid protein during capsid 
maturation. As with toxin-encoding mRNAs, the HaSV capsid delivers it to the 
midgut cell of the feeding insect, where it exerts the desired toxic effect. 

iv) Use of HaSV in plants 

The use of HaSV in the production of insect-resistant transgenic plants are 
shown in Fig. 12. These inventions are based on the use of either the complete HaSV 
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genome, or of the replicase gene as a tool for the amplification of suitable amplifiable 
mRNAs (e.g. encoding toxin) or of the capsid protein as a means to deliver insecticidal 
agents. These strategies are now described in some detail 

5 a) Use of the complete HaS V genome 

Fragments of cDNA corresponding to the full-length HaS V genome 
components RNAs 1 and 2 (SEQ ED Nos; 39 and 47) are placed in a suitable vector for 
plant transformation under the control of either a constitutive plant promoter (e.g. the 
CaMV 35S promoter mentioned above) or an inducible promoter or a tissue specific 
10 (e.g. leaf-specific) promoter. The cDNAs are followed by a cis-cleaving ribozyme and 
a suitable plant polyadenylation signal. Transcription and translation of these genes in 
transgenic plant tissues and cells leads to assembly of fully infectious virus particles to 
infect and kill feeding larvae. 

15 The following experiments were conducted. The plasmids for expression used the 

CaMV 35 S promoter to generate transcripts commencing at the first nucleotide of the 
HaSV RNAs 1 and 2 (SEQ ID Nos: 39 and 47). The vector pDH51 (M. Pietrzak, R. 
Shilito, T. Hohn and I. Potrykus (1986). Nucleic Acids Research 14, 5857) which 
carries the CaMV 35S promoter followed by a multiple cloning site and the CaMV 

20 polyadenylation fragment was modified to make a suitable vector, pDH5 1 Stu, carrying 
a StuI site at the immediate 3' end of the CaMV 35S promoter. The promoter thereby 
terminates in the sequence GAGAGGCCT, with the underlined residue being that at 
which transcription would start. (Similar vectors have been described by Mori et al. y J. 
General Virology 72, 243-246 (1991) and Dessens and Lomonossoff; ibid 74 ? 889-892 

25 (1993).) The StuI site (AGG/CCT) is followed by a BamHI site (GGATCC). 

Cleavage of this vector with StuI and BamHI generates a vector DNA molecule with 
one blunt end (from StuI cleavage) and one sticky BamHI end. This allows ligation of 
cDNA molecules corresponding to the full-length HaSV genomic RNAs, and carrying 
a blunt end at the 5' end of the full-length cDNA and a BamHI site after the 3 '-end of 

30 the full-length cDN A. 
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Suitable cDNA fragments carrying a blunt end corresponding to the 5 '-terminal 
nucleotide of either RNA 1 or 2 (SEQ ID Nos: 39 and 47) were generated using PCR 
and an oligonucleotide primer corresponding to the 5'-terminal first 18 nucleotides of 
the sequence of either RNA 1 (SEQ ID No: 39) or RNA 2 (SEQ ID No: 47). The 
5 cDNA sequence corresponding to the 3 f terminal sequences of either RNA 1 (SEQ ID 
No. 39) or RNA 2 (SEQ ID No 47) were followed on these DNA fragments by 
sequences corresponding to one of the ribozymes whose sequences are shown in Fig. 8 
and whose construction is described in Example 7. The 3 '-terminal sequence 
corresponding to an Xbal site (TCTAGA) shown in these ribozyme sequences was 
10 followed on the suitable DNA fragments by a BamHI site, which upon cleavage with 
this enzyme yielded a sticky end capable of being ligated into the BamHI end of the 
vector cleaved as described above. There were therefore a total of four suitable DNA 
fragments for insertion into the vector: 

1 5 RNA 1 (SEQ ID No: 3 9) followed by the hairpin cassette (HC) ribozyme 

RNA 1 (SEQ ID No: 39) followed by the hepatitis delta virus (HDV) ribozyme 
RNA 2 (SEQ ID No: 47) followed by the hairpin (HC) ribozyme 
RNA 2 (SEQ ID No: 47) followed by the hepatitis delta virus (HDV) ribozyme. 
These four fragments were individually ligated into the vector pDH5 IStu cleaved with 

20 StuI and BamHI to generate four distinct plasmids as follows: 
pDHStuRlHC 
pDHStuRlHDV 
pDHStuR2HC 
pDHStuR2HDV 

25 Transcription from the 35S promoter in these plasmids results in RNAs commencing 
at the first nucleotide of either the RNA 1 sequence (SEQ ID No: 39) or RNA 2 
sequence (SEQ ID No: 47) and terminating in the CaMV polyadenylation fragment. 
Self-cleavage at the locations shown in Fig. 8 by the cis-acting ribozymes obtained 
within these transcripts generates RNA molecules with the 3 '-termini corresponding to 

3 0 the natural virus termini . 
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After amplification and purification on CsCl gradients, thirty mg of each of these four 
plasmids was transfected by electroporation into aliquots of two million N. 
plumbaginifolia protoplasts (as described in Example 5) either individually or in the 
combinations listed below: 
5 pDHStuRlHC + pDHStuR2HC 
pDHStuRlHDV + pDHStuR2HDV 



The production of infectious HaS V particles within transfected protoplasts was then 
demonstrated by bioassay on heliothis larvae. After incubation at 25 °C for 3-5 days, 

10 the protoplasts were recovered by low speed centrifugation and applied directly to 

standard heliothis diet as surface contamination for bioassay as described in Example 
1 . Stunting was only observed when plasmids expressing HaSV RNA 1 (SEQ ID No: 
39) and RNA 2 (SEQ ID No: 47) were co-transfected ? and then only in the case of 
those carrying the hairpin ribozyme to generate the viral 3' ends (see Table 2). In 

15 contrast, constructs carrying the HDV ribozyme at the 3 f end were not infectious. The 
reasons for this have not been determined. As expected, expression of RNA 1 or 2 
(SEQ ID Nos: 39 and 47) alone in protoplasts did not lead to the assembly of 
infectious particles. Western blot analysis of protoplasts transfected with the RNA 2 
(SEQ ID No : 47) constructs did show production of limited amounts of the capsid 

20 protein. 
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Suitable control experiments confirmed that larval stunting was due to HaSV particles 
generated de novo in the protoplasts. As shown in the Table 2, neither the protoplasts 
alone nor protoplasts mixed with plasmid DNA were capable of initiating stunting. 

Table 2 



Treatment No. of Escapes No. 

larvae stunted 

1. diet alone *24 0 0 

30 2. diet + HaSV 24 0 24 

3. diet + protoplasts 24 0 0 



88 



4. diet + pDHStuRlHC 24 0 0 

5. diet + pDHStuRlHDV 24 0 0 

6. diet + pDHStuR2HC 24 0 0 

7. diet + pDHStuR2HDV 24 0 0 
5 8. diet + pDHStuRlHC + 24 0 22 

pDHStuR2HC* 

9. diet + pDHStuRlHDV 24 0 0 

+ pDHStuR2HDV* 

10. pDHStuRlHC + pDHStuR2HC 24 0 0 



1 0 (but mixed with protoplasts) 



* these plasmids were co-transfected with pDHVCAPB (see Example 5) 

m HaSV infection of stunted larvae was confirmed by dot-blotting of RNA using HaSV 
J§ specific probes. After weighing, larva were sacrificed and total RNA extracted as 
CP follows. Each larva was homogenised in the presence of 260 ml deionised water, 24 
ill ml 2M sodium acetate pH 4.0 and 200 ml phenol equilibrated with 2M sodium acetate 
f pH 4.0. After centrifugation at 14 000 rpm for 15 min at 4°C, the supernatant (about 
p 200 ml) was removed and extracted once with an equal volume of chloroform. After 
5*30 centrifugation, the supernatant (about 200 ml) was mixed with 20 ml of sodium acetate 
O and 400 ml of absolute ethanoL The precipitate after centrifugation was vacuum dried 
and redissolved in 5-10 ml of sterile, DEPC-treated water. For dot-blotting, the RNA 
was mixed with 70 ml of DEPC-treated water and 30 ml of 10 mM EDTA, 30 mM 
NaOH. HaSV RNA was determined and quantified by dot blotting (as described in 
25 Example 2) using a probe random primed DNA from clones corresponding to the 
terminal 1000 nucleotides of RNA 1 and 2. All larvae recorded as stunted in the 
bioassays were found to carry HaSV and give signals comparable to those of the larvae 
fed purified HaSV particles (Table 2). To confirm that the larvae were infected with 
HaSV, ten aliquots of protoplasts were electroporated with plasmids pDHStuRlHC + 
30 pDHStuR2HC and the protoplasts fed (after incubation) to 150 heliothis larvae. The 
larvae were allowed to grow for one week, upon which significant stunting was 
observed in 50% of the larvae, and virus was then purified from these stunted larvae as 
described in Example 1 . Analysis on CsCl gradients showed the production of distinct 
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the RNA can be encapsidated by HaSV capsid protein and/or replicated by the HaS V 
replicase in infected insect cells (see Figs. 12a and 12b) 

Transgenic plants would contain two different transgenes, making either 
5 unmodified capsid protein precursor or a modified form in which most of the 

carboxyterminal protein P7 is replaced by a suitable insect-specific toxin or one which 
is inactive as part of a fusion protein. (Gelonin or other ribosome-inactivating 
proteins, insect gut toxins or neurotoxins may be suitable here.) Expression from these 
two transgenes would be regulated so that only the required amounts of the modified 

10 and unmodified forms are made in the plant cell, and assembled in such proportions 

into the capsoids. One way to modulate the production of capsotixin fusion proteins is 
to make translation of the carboxyterminal toxin reading frame dependent on a 
translational frameshift or read-through of a termination codon. With an appropriate 
low frequency of frame-shifting (eg 0.1- 2%), it could even be sufficient to use a 

1 5 single transgene, if it were possible to synthesise the P7 portion and the toxin portion 
as overlapping genes. Upon assembly (which we have demonstrated in insect cells 
using the baculovirus vectors) and maturation, the protein precursors are cleaved and 
release the mature P7 and the toxin, which remain within the capsoids. These proteins 
are not released until capsoid disassembly occurs in insect gut cells. The processed 

20 form of the toxin is then able to kill the pest. 

(c) HaSV particles devoid of nucleic acid carrying one or more suitable protein 
toxins and/or their mRNA 

A protein toxin (or toxins) is expressed as a fusion with the capsid protein. The 
25 fusion protein then assembles into capsid carrying the toxin(s). These capsids present 
in the plant tissue exert an antifeeding effect on insects attaching the plant. 

EXAMPLE 10 

EXPRESSION OF HaSV IN OTHER DELIVERY VECTORS 
30 Materials & Methods 

(as indicated in earlier Examples) - 
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Constructs similar to those for plant expression are introduced into yeast or bacteria by 
standard techniques. Virus particles are assembled for either fully infectious virus or 
any of the modified or biologically contained forms described in Example 9. Microbes 
produced in suitable fermentation or culture facilities and carrying such forms of the 
5 virus are then delivered to the crop by spraying. The microbial cell wall provides extra 
protection for the virus particles produced within the microbe. 

Well established techniques exist for culture and transformation of yeast (Ausubel, 
F.M. et al (eds) Current Protocols in Molecular Biology. J. Wiley & Sons, NY, 1989). 
10 An example of a yeast expression vector is pBM272, which contains the URA3 

selectable marker (Johnston, M. & Davies, R.W. Mol. Cell. BIoL 4, 1440-8, (1984); 
Stone, D. & Craig, E. Mol. Cell. Biol. 10, 1622-32 (1990). Another example of an 
expression vector is pRJ28, carrying the Trpl and Leu2 selectable markers. 

15 Yeast has recently been shown to support replication of RNA replicons derived from a 
plant RNA virus, brome mosaic virus (Janda, M. & Ahlquist ; P. Cell 72, 961-70 
(1993). Since the BMV replicase is distantly related to that of HaSV, and the two 
viruses are likely to replicate by similar strategies within cells, yeast cells probably 
contain all the cellular factors required for HaSV to generate infectious virus. 

20 

For bacteria, suitable expression vectors have been described above. 

EXAMPLE 11 

The transvirus approach for insect pest control: Making transgenic plants 
25 expressing HaSV 

1. Vector construction 

A special binary vector was constructed for transforming plants with the HaSV 
genome. This vector is based on pART27 (A. Gleave (1992) Plant Mol.Biol.20, 1203- 
1207), which was modified to (1) carry an alternative origin of replication for the host 
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Agrobacierium tumefaciens and (2) incorporate restriction sites in the multiple cloning 
site for restriction enzymes Asc I and Pac I which recognise rare (8bp) sequences. 



For engineering the multiple cloning site, pART27 was cut with Spel and Notl. Ten 
5 picomoles of each of the two oligos whose sequence follows (TOP and BOTTOM) 
were annealed in 10 microlitres of water (heated to 80°C for 2 min and allowed to cool 
slowly to room temperature). The sticky ends on these annealed oligonucleotides 
allowed the insert to be cloned into pART27 (giving pART27mod) as described in 
Example No. 3 and 9. 
10 Sequence of oligonucleotide: ' 

TOP : 5 '-GGCCGCTTAATTAAGGATCCGGCGCGCCA-3 ' 

BOTTOM: 3-CGAATTAATTCCTAGGCCGCGCGGTGATC-5 

(The Pad recognition sequence is TTAATTAA and that for AscI is GGCGCGCC). A 
15 4kbp Sail fragment from plasmid pART27mod (containing the right border, IacZ 

marker (-^multiple cloning site)nptll gene for kanamycin resistance under control of 
the nos promoter and polyadenylation signal and the left border) was cloned into the 
13kbp vector pKT231 linearised with Xhol. Plasmid pKT23 1 carries the IncQ origin 
of replication for the host Agrobacierium tumefaciens and a resistance (marker) gene 
20 for streptomycin/spectinomycin. (Bagdasarian, M. & Timmis, K.N. (1982) Curr. 
Topics Microbiol. Immunol 96, 46-67). These two fragments were ligated using 
standard protocols (eg in Example No 3) and transformed into E.coli strain DH5a 
using standard protocols (eg in Example No 3). The resultant plasmid was named 
pJDMLl. 
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2* Cloning HaSV genes into transfer plasmid 

Construction of transfer vectors with HaSV genes: 

5 Before the HaSV gene cassettes could be cloned into binary transfer vectors pART27 
mod or pJDMLl, they were re-cloned into the vector plasmid pBJ33 to provide 
flanking AscI and Pad sites. Plasmid pBJ33 (provided by Bart Janssen) is based on 
pBC SK(+) supplied by Stratagene), but with a multiple cloning site modified to 
contain the following sites: 
10 SacI/PacI/AscI/SacII/Xbal/Spel/BamHI/Pstl/EcoRI/EcoRV/Hindn 
al/PacI/AscI/KpnL 

The cDNA fragment corresponding to complete HaSV RNA 1 behind the 35S 
promoter and terminating in the hairpin cassette ribozyme and the CaMV 

1 5 polyadenylation signal fragment (approx 6 kpb in total) was excised from plasmid 

pDHStuRlHC (Example 9) with EcoRI and cloned into EcoRI-cut vector pBJ33 to 
give plasmid pBJ33RlHC. Similarly, the cDNA fragment corresponding to complete 
HaSV RNA 2 behind the 35 S promoter and terminating in the hairpun cassette 
ribozyme and the CaMV polyadenylation signal fragment (approx 3.3 kbp in total) was 

20 excised from plasmid pDHStuR2HC (Example 9) as two fragments, one (covering the 
35S promoter and the first 500 bp of the RNA 2 sequence) of about lkbp with EcoRI 
and R5rII and the second (covering the remainder of the RNA 2 sequence, the 
ribozyme and the polyadenylation signal) of about 2.3kbp with RerlT and Hindlll. 
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These two fragments were simultaneously ligated into EcoRI and Hindlll-cut vector 
pBJ33 to give plasmid pBJ33R2HC. 

A 1.9 kbp fragment comprising the 5' 1.7 kbp of the HaSV capsid gene, 
together with the polyadenylation fragment, were excised from expression plasmid 
5 pDHVCAPB (described in Example 5) as a Eco RI - Kpnl fragment and cloned into 
pTZ19U (pharmacia) cut with EcoRI and Kpnl, giving pTZ19UEVCAPB., This 
portion of the HaSV capsid gene expression cassette was then re-excised as a Hindlll- 
EcoRI fragment and cloned into PBJ33 cut with these enzymes. This plasmid 
(pBJ33EVCAPB) was then linearized with EcoRI and the ca. 800 bp EcoRI fragment 
10 from pDHVCAPB carrying the 35S promoter and the 5' 250 bp of the capsid gene 
inserted, followed by screening for orientation. The resulting plasmid carrying the 
reassembled complete capsid gene expression cassette was named pBJ33VCAPB. 

Assembling binary plasmids: 

15 

The RNA 1 expression cassette was excised from plasmid pBJ33RlHC with AscI and 
Pad and cloned into pART27 mod cut with AscI and Pad to give pMLRL The RNA 
2 expression cassette was also cloned as an AscI-PacI fragment into pJDMLl cut with 
AscI and Pad to give pJDMLR2. 

20 

The capsid protein gene cassette was excised from pBJ33 VCAPB with Pad and 
cloned into plasmid pMLRl cut with Pad. Resulting plasmids were screened for 
orientation and the plasmid with the capsid gene and RNA1 in the same orientation 
was named pMLRl V. The complete fragment carrying the HaSV capsid gene and 
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RNA 1 expression cassettes in pMLRl V was excised with AscI and cloned into 
pJDMLR2 linearised with AscI to give pHaSVl (29kpb). This plasmid carries the 
HaSV capsid gene expression cassette and the HaSV RNA 1 and RNA 2 expression 
cassettes in this order and all in the same orientation. The kanamycin resistance gene 
5 is located upstream of the capsid gene and in the opposite orientation. 

Table of constructs generated: 



Vector 


Insert(s) 


Name 


#Plants 


Comments 








(independent 










transformants) 




pART27mod 


RNA 1 


pMLRl 


15 


control 


pJDMLl 


Rl + R2 +CAP 


pHaSVl 


30 


complete virus 


pART27mod 


Rl + CAP 


pMLRlV 


15 


subvirus 


pJDMLl 


Rl +CAP 


pJDMLRlV 


30 


subvirus 


pART27mod 


RNA2 


pMLR2 


15 


control 


pJDMLl 


RNA2 


pJDMLR2 


15 


control • 


pART27mod 


CAP 


pMLVF 


15 


control 



(CAP = HaSV capsid gene) 

20 3. Plant transformation and regeneration 

Binary transfer vectors (above) were transfopned into Agrobacterium tumefaciens 
strain LBA4404 by electrop oration (Lin, JJ. (1994) FOCUS 16,18-19; Lin, JJ. (1994) 
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Plant Science 101, 11-15). Leaf discs from Nicotiana tabacum grown under sterile 
conditions were transformed using co cultivation with transformed A. tumefaciens 
(Horsch, R.B. etal (1984) Science 23, 496-498; Horsch, RB. etal (1988) Plant 
Molecular Biology Manual A5: 1-9; as modified by Lisa Molvig (pers. comm.)) and 
5 grown on kanamycin. After transfer of regenerating shoots for further selection on 
kanamycin medium, kanamycin-resistant roots were selected and then tissue from 
these plants used to verify HaSV gene expression. The numbers of plants selected are 
shown in the table above for each of the constructs. 

10 4. Western, Northern and Southern blotting on transgenic plants 

For western blots: A small amount (0. lg) of fresh leaf material from each plant was 
extracted by grinding in 0.2 ml of plant extraction buffer (0.2M NaCl, 0. 1M Tes, pH 
7.65, ImM PMSF, 2% b-mercaptoethanol, ImM EDTA). After centrifiigation to 
15 pellet plant debris the supernatant was collected and lOjal aliquots run on a SDS-gel 

for blotting and immuno-analysis with antibody against HaSV as described in Example 
1 . The results for the first plants assayed are given in Table 3 . 

For Northern blots; Total leaf RNA was extracted from 0. 1 5 g of fresh leaf material. 
20 The leaf material was ground under liquid nitrogen to a powder and then extracted by 
forther grinding in 0.45 ml NTES buffer (0. 1 M NaCl, lOmM Tris-HCl pH 8.0, ImM 
EDTA, 0.1% SDS) plus 0.45 ml Tris pH8.0-saturated phenol/chloroform. The slurry 
was vortexed, centrifuged for 3 min and the aqueous phase mixed with 1 volume of 
isopropanol to precipitate RNA andT)NA. After resuspending the pellet in 0. 1 ml 
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water, 1 volume of 4M LiCl was added and the mix stood on ice overnight before 
centrifugation to pellet RNA. The RNA was then analysed by gel electrophoresis 
according to the methods in Examples 1 and 2. HaSV specific RNAs were detected by 
Northern blottings as described in Example 2 and by using riboprobes made to detect 
5 the 3 '-terminal 1000 nucleotides of each of RNA 1 and 2, made using the Promega 
Riboprobe kit and used as specified by the supplier. 

For Southern blots: to detect HaSV genes in plant genomic DNA. 

10 To recover plant genomic DNA, the supernatant from LiCl precipitation (above) was 
mixed with 2 volumes of ethanol. The pellet was redissolved and the DNA cut with 
BamHI before analysis on agarose gels and transfer to nylon membrane as described 
by Sambrook et al (1989) and by the manufacturer (Zetaprobe/BioRad). HaSV- 
specific bands were detected described above. 

15 

5. Bioassays on leaf material 

Two small leaves (2-3 cm in length) were selected from each transformed 
plant selected, and placed in petri dishes containing 1.5% agarose in water. Three to 8 
20 neonate larvae were placed in each petri dish and observed for 3 days. At the end of 
this time, larvae were weighed and then total RNA extracted as described in Example 
1. The extent of leaf damage was quantified by measuring the area of leaf consumed 
by each group of larvae over the three days of the assay (see Table 3). 
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Table 3 



Preliminary bioassay of HaSV transgenic plants 

Three to 8 larvae were placed on a small leaf (from a newly regenerated plant) in a 
petri dish with no provision of fresh food, after 3 days, larvae were sacrificed and 
northern blotted; also, protein extracted from leaves of the plants were western blotted 
using anti-HaSV antisera. 



Plant 


Transform- 
ation 
Plasmid 


Western Blot 
for HaSV 
capsid 
protein in 
plant (+/-) 


Northern 
blot for 
HaSV 
RNA in 
plant 


Larval 
Weight 
(mg) ' 


Leaf 

Damage 

(mm 2 

consumed 

/larvae) 


Negative 
Controls 








2.1- 

2.7+0.8* 


61 


1.1 

(subvirus) 


pJDMLRlV 


+ 




1.1+0.2 


29 


(RNAl=p 
71) 












3.2 (whole 
virus) 


pHaSVl 


+ 




1.0+0.4 


38 


3.4 (whole 
virus) 


pHaSVl 


+ 




1.2+0.4 


32 



* Diet was limiting (ran out of food) in some cases 



Table 4 

Further bioassay of HaSV transgenic plants 

Four - 6 individual larvae were fed leaf disc (50mm 2 ) from control or transgenic plants 
at one disc each per day for 4 days, before transferred to artificial diet for a further 3 
days. RNA was then extracted from the larvae and Northern blotting with HaSV- 
specific probes used to verify the presence of HaSV in the larvae. 
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Plant 



negative control 

positive control 
(leaf + HaSV) 
3.2 



Transformed with 



pHaSVl 



Western blot for Mean larval 

HaSV capsid protein weight (mg) 
in plants (+/-) 

12.4 
0.1 



+ 



0.9 



3.10 
3.11 



pHaSVl 
pHaSVl 



+ 



+ 



4.8 
8.2 



Efficacy of HaSV as atransvirus in plants 

Factors affecting the efficacy of HaSV are the viral dose required, the expression 
levels achieved in plants and the leaf damage observed. These need to be considered 
separately at this stage due to uncertainty about the efficiency of HaSV assembly in 
plants and because larvae will continue feeding for about one day after receiving a 
toxic dose of HaSV. 
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Dose of Virus 

Infection with HaSV requires neonate larvae to eat up to 10 000 particles. 
Assuming that transgenic plants make only 1 particle per cell, this means the 
larvae must consume up to 10 000 leaf cells. 

Since a small tobacco leaf contains about one million cells, larvae could 
acquire a toxic dose by consuming just 1% of the leaf 

This dose would correspond to as little as 0.000 000 5% of the soluble protein 
in these cells (330x 10' 13 g of HaSV per leaf in 7xl0" 13 g soluble plant protein 
per leaf). 

Expression levels , 

Assuming standard levels of 1% expression and complete incorporation into 
virus particles, there should be about 10 8 particles per cell (7xl0 _9 g of protein 
per cell over 330x1 0" 19 g per HaSV particle). 

However, at present only part of this protein is likely to form infectious virus. 
If 1% does, then there would be 10 6 particles per cell, well above the toxic 
dose. 

Initial results from Western blots suggest current expression at least exceed 
01.1% of soluble cell protein. Processing of the precursor protein appears to 
occur to a variable extent, suggesting that particle assembly has also occurred. 

The dose of infectious virus delivered by transgenic plants must be quantified 
by appropriately standardised bioassays. 

Optimisation of the infectious virus level will be achieved by improving virus 
assembly rather than just boosting expression of components - this represents 
a fundamental difference to the situation with toxins like Bt. 
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Dose of Virus 

Infection with HaSV requires neonate larvae to eat up to 10 000 particles. 
Assuming that transgenic plants make only 1 particle per cell, this means the 
larvae must consume up to 10 000 leaf cells. 

Since a small tobacco leaf contains about one million cells, larvae could 
acquire a toxic dose by consuming just 1% of the leaf. 

This dose would correspond to as little as 0.000 000 5% of the soluble protein 
in these cells (330x 10" 13 g of HaSV per leaf in 7xl0" 13 g soluble plant protein 
per leaf). 

Expression levels 

Assuming standard levels of 1% expression and complete incorporation into 
virus particles, there should be about 10 8 particles per cell (7xl0" 9 g of protein 
per cell over 330xl0" 19 g per HaSV particle). 

However, at present only part of this protein is likely to form infectious virus. 
If 1% does, then there would be 10 6 particles per cell, well above the toxic 
dose. 

Initial results from Western blots suggest current expression at least exceed 
011% of soluble cell protein. Processing of the precursor protein appears to 
occur to a variable extent, suggesting that particle assembly has also occurred. 

The dose of infectious virus delivered by transgenic plants must be quantified 
by appropriately standardised bioassays. 

Optimisation of the infectious virus level will be achieved by improving virus 
assembly rather than just boosting expression of components - this represents 
a fundamental difference to the situation with toxins like Bt. 
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Leaf damage 

While as little as 1% of the leaf (and more likely far less) may be sufficient to 
deliver a toxic dose of HaSV, larvae will keep feeding for a limited period 
5 after becoming infected. This makes it necessary to determine the extent of 

leaf damage empirically. 

Our initial observations were that plants making detectable levels of HaSV 
capsids showed reduced susceptibility to larval feeding; this has not been 
10 quantified yet, and the assay was a severe one. 

Consumption of leaf material by infected larvae may be estimated indirectly 
using our data on larval growth and frass production, which are approximately 
equal. Since neonate frass production is too low to quantify, the data were 
1 5 obtained from 4-day old larvae. These produce 30 mg of frass over 7 days, 

compared to 400 mg for uninfected controls. Neonate growth and frass 
production may be estimated at 10% of this figure. 

Assuming that 1 mg growth or frass - 3 mg leaf material, an infected neonate 
20 will consume about 5% of a small tobacco leaf (20 mg of a total fresh weight 

of 350 mg) over seven days compared to over 60% for an uninfected control 
(240mgof350 mg). 

Biosafety Considerations 

25 

It is believed that the approach of controlling pests by making an insect virus in 
transgenic plants is not dangerous to the environment. This is despite our very 
tentative observation that some HaSV replication is observed in protoplasts. There has 
been widespread debate recently concerning the safety of protecting crops against 
30 plant viruses by inserting transgenes expressing viral proteins into the plants. Falk, 
B.W. and Bruening, G., 1994 (Science 263, 1395-1396) identified 3 possible 
mechanisms which might result in the appearance of novel viruses. These mechanisms 
are transencapsidation, phenotypic mixing and heterologous recombination. 
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Flasmids 


Bioassays weights (mg) of 
larvae fed on protoplast 
extracts 


HaSV RNAs 1 & 2 
Q elected ijy r>uriiierii 
blotting of RNA extracted 

ft*nm lsifvap 

11 \J 111 1A1 V t*.\s 


1. P DHStuRlHC + 
pDHVCAPB 


29 ±15 


+ 


2. pDHStuRl HDV + 
pHVCAPB 


57 + 25 




3. Control: (diet only/diet 
+ protoplasts 


85 ±15 




4. pDHStuRl HC + 
pDHStuR2 HC + 
pDHVCAPB 


33 ±28 


+ 


5. pDHStuRl HDV + 
pDHStuR2 HDV + 
pDHVCAPB 


64 ±22 





ii) RNA extraction from larvae showed 

(a) that larvae fed protoplasts transfected with pDHStuRlHC + 
pDHStuR2HC + DHVCAPB contained both RNA1 and 2 of HaSV in 
intact form. 

(b) that larvae fed protoplasts transfected with pDHStuRlHC + DHV CAPB 
(subvirus) contained a very small amount of intact HaSV RNA1 and a 
considerably greater amount of degraded RNA1. 

(c) that larvae fed protoplasts transfected with pDHStuR 1 HDV + 
pDHStuR2HDV + pDHVCAPB contained no HaS VRNA with one 
exception. 

(d) that larvae fed protoplasts transfected with pDHStuRl HDV + 
pDHVCAPB contained no HaSV RNA. 

Conclusions: 

The HC (HaSV expression) constructs with the hairpin cassette ribozyme give 
infectious particles with both RNAs; the HDV expression constructs do not under 
these conditions. 

That the subvirus approach results in RNA1 replicating in larvae but this RNA is 
degraded because it cannot be encapsidated due to missing replicatable RNA2. 

That subvirus approach gives stunting as effectively as does the complete virus 
approach under these conditions. 
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EXAMPLE 12 
CAPSOVECTOR 

5 The aim of this section is to describe the invention of capsovectors and present 
supporting data. In addition ideas for their further improvement will be presented. 

The capsovector is a virus-like particle (VLP) from a small RNA insect virus with a 
pest insect host whose properties will facilitate the entry (or vector) into the cells of 
10 the pest insect either, or both, RNA or a protein that will induce toxicity in the insect. 
Capsovectors can be produced in transgenic crop plants targeted by the host insect 
pests or produced for spray applications by transgenic plants or recombinant 
microorganisms. 

1 5 There are two types of capsovectors, ones that vector RNA moieties (RNA 

capsovectors) and ones that vector proteinaceous moieties (protein capsovectors). 
Because of their distinct properties, RNA and protein capsovectors will be dealt with 
separately. However, the description of the protein capsovector will rely heavily on 
the preceding description of the RNA capsovector. 

20 

In this invention, the Helicoverpa armigera stunt virus (HaSV) of the Tetraviridae will 
be used as the model insect small RNA virus. However, this does not exclude other 
types of insect small RNA viruses being used in a similar manner. 

25 Characteristics of HaSV pertinent to capsovectors, 

HaSV is a member of the Tetraviridae family of viruses and infects only midgut cells 
of young larvae of heliothine insects (Hanzlik et al., 1993) after ingestion with food. 
Numerous attempts to grow the virus in non-gut tissue, other insects and cultured cells 
30 have failed. This believed to be due to the ability of the HaSV capsid protein to bind 
and enter only midgut cells of the host insect (Hanzlik et al, 1995). 

HaSV has been characterised in great detail at the molecular level. Its physical 
characteristics have been determined (Hanzlik et al, 1993) and its complete genome 
35 sequenced (Gordon, et al, 1995; Hanzlik, et al, 1995). 
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particles is aimed at replication of new virions replication with any toxicity is a by- 
product of this process; that contained within capsovectors is aimed at rapid toxicity to 
the cell and not at replicating new capsovectors. 

Important to the toxic activity of a RNA capsovector are the properties of viral genes 
that lead to the activities of encapsidating, protecting, entry, uncoating and translating 
of the viral RNA in the host cell. These activities are necessary for successful virus 
infection yet do not involve replication. Because of the simplicity of HaSV, all of 
these properties are contained in one gene and its product, the coat protein, p71. 
Generally, the coat protein gene of other small RNA viruses have the same capacity 
and therefore are also suitable for capsovectors. The p71 gene of HaSV is employed 
in illustrating the capsovector invention and its properties are explained and 
demonstrated under their respective headings below. 

The capsovector adds toxicity to the virus functionalities by using sequences 
exogenous to HaSV or viruses on the RNA contained within the capsovectors. These 
toxic sequences are aimed at inducing rapid and direct toxicity to the cell the 
capsovector has entered. The sequences are either translated into protein or fold the 
RNA into appropriate secondary structures which then causes a toxic lesion in the cell. 
These sequences are explained below as well. 

It must be pointed out here that viruses avoid inducing rapid and direct toxicity to cells 
they have entered as this is deleterious to viral replication. Because the cell must be 
viable for the production of new viruses, many viruses do not induce cell death until 
late in their infections if at all. Indeed, infection of hosts by many insect small RNA 
viruses different from HaSV does not result in any discernible response in the host 
insect upon its being infected. These innocuous viruses can still be Exploited, 
however, because the viruses are able to perform the previously mentioned functions 
of encapsidation and protection of the labile RNA genome, ability of the particles to 
enter host cells, uncoating and subsequent translation of the RNA genome. 
Appropriate placement of toxin sequences with those of the coat proteins of these 
viruses as described in this invention can result in toxicity and subsequent control of 
the insect pest host. 

Encapsidation. Encapsidation is defined here as the process of forming a virus 
particle or a virus-like particle that incorporates RNA into its interior. For this process 
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to occur, interactions among the capsid proteins (encoded by the RNA) and between 
the proteins and specific regions of the RNA must occur that result in the ordered 
aggregation of 240 copies of the coat protein and the RNA strand(s). The specific 
regions on the RNA are either or both a defined primary sequence or a specific 
5 secondary structure arrived at by a number of primary sequences. 

All three types of interactions, protein/protein, RNA/protein and RNA folding into 
secondary structures, reside in the capsid gene ORF of HaSV. This is shown by the 
following data: 

10 1. When the ORF the HaSV capsid protein is expressed in insect cells with a 

recombinant baculovirus, VLPs can be observed with transmission electron 
microscopy (TEM) in sections of fixed, positively-stained, infected cells. 
They are highly similar in morphology and dimensions to native virions 
observed in gut tissue of diseased insects. The VLP morphology is that of a 
1 5 smooth surfaced sphere with a diameter of 3 5-40 nm identical to virus 

particles observed in diseased tissue. Also noted is that a fraction of the 
particles have dark, electron dense cores. The fraction of VLPs having 
electron dense cores is smaller than that observed for virus particles observed 
in diseased tissue. The electron dense cores indicate that the particles contain 
20 electron dense, non proteinaceous RNA. 

Methods and Materials: A recombinant baculovirus able to express p71 
was made by the following procedure. An amplicon containing the p71 
ORF was obtained from a PCR reaction made with HP64NEUK 
(GGCCGGATCCAGACATGCTGGAGTGGCGTCAC) and HVP6C2 
25 (GGGATCCCT AATTGGCACGAGCGGCGC) off a DNA template 

consisting of the pT7T2p71 plasmid used for the bacterial expression of 
the capsid gene. This fragment contains the p71 ORF flanked with BamHI 
sites and the initiating AUG placed behind a more favourable context for 
expression in eukaryotes. This fragment was restricted with BamHI and 
30 cloned into the baculovirus expression vector, pVL941, which was 

transfected into Sf9 cells with linearized wildtype baculovirus DNA. A 
recombinant baculovirus, Bacp71, was obtained by standard means (King 
andPossee, 1992). 

For TEM, Sf9 cells infected for three days with Bacp71, were harvested, 
3 5 fixed with gluteraldhyde^ embedded in LRWhite resin and sectioned for 
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examination with TEM using standard procedures. The sections were 
examined with a JEOL 100CX transmission electron microscope. 

When the VLPs are purified and characterised, they also show highly similar 
characteristics to native HaSV virions. Buoyant densities of the particles are 
similar 1 .296 g/ml for virions and 1 .29 g/ml for VLPs. Particle morphologies 
as seen by negative stained TEM are highly similar with the micrographs for 
both virions and VLPs showing spheres of 35-40 nm spheres possessing 
electron dense interiors. The electron dense interiors also indicate the 
presence of RNA. 

Methods and materials. After a three day infection with Bacp7 1 in a 220 
cm 2 flask, Sf9 cells were harvested, pelleted, washed, and lysed with a 
freeze-thaw cycle in 10 ml of buffer. This was centrifuged at 10,000 x g 
for 10 min and the supernatant was recovered and recpntrifuged on top of a 
30% sucrose cushion. The pellet was then redissolved in 0.5 ml of buffer 
(50mM Tris pH7.5, 5 mM CaCl) overnight and placed on top of a solution 
in a SW41 tube (Beckman) consisting of 5 mis each of 60% and 30% of 
CsCl in buffer. This was centrifuged overnight at 40,000 rpm and the sole 
band located in the middle of the tube was removed, placed into 10 ml of 
buffer and pelleted by centrifugation at 35,000 rpm in a SW41 rotor. The 
pellet was dissolved in 50 m 1 of buffer. 

VLPs in the solution were examined on a JEOL 100CX microscope with 
standard procedures after negative staining with uranyl acetate.. 

The VLPs contained RNA. The above observations indicated that the VLPs 
in the were electron dense and that RNA was within the VLPs. These 
observations were confirmed when the VLPs were extracted for RNA and the 
RNA analysed by agarose gel electrophoresis. The RNA was 2900 bases in 
length, the expected size of the mRNA transcribed from the baculoviral 
genome. When the RNA was probed with a radioactively labelled probe 
specific for p71 sequences, strong hybridisation occurred, showing that the 
RNA was the p71 mRNA. 

Methods and materials. RNA was removed from the purified VLPs with 
extraction with phenol, phenol/CH3Cl and CH3C1 and ethanol 
precipitation. This was then run on an 1% formaldehyde agarose gel. A 
northern blot of the gel was done with standard procedures. The P- 
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labelled probe was prepared from clone HR326 which contains the p71 
sequence. 

4. The p71 mRNA extracted from the VLPs are shown to have heterologous, 

5 non-HaSV sequences. This is shown by the recovered RNA strand having a 

length of 2900 bases which is equal to the predicted size of the mRNA 
transcribed from the baculoviral genome and expressing the p71 ORF. Only 
1900 bases of the mRNA are sequences from the p71 ORF, the rest being 
sequences from the baculovirus expressing the gene. When a radioactively 
10 labelled probe specific for the polyhedrin gene located between the p71 gene 

the polyadenylation signal for the polyhedrin gene was hybridized to the RNA 
extracted from the VLPs, a strong hybridation signal was seen on the autorad. 
This shows that the signals required for encapsidating RNA are present in the 
p71 ORF and that non-HaSV sequences can be placed inside the VLP. 
1 5 Methods and materials. RNA extracted from VLPs were northern blotted 

by standard procedures and probed with a labelled 950 bp Hind III 
fragment from the baculoviral transfer vector pVL941 having sequences 3 5 
to the inserted p71 gene and 5' to the polyadenylation site of the 
polyhedrin gene. 

20 

5. The RNA having the capsid protein ORF is specifically encapsidated. No 

other RNA except for the p71 mRNA is present in the VLPs. This is shown 
by the failure of another highly transcribed region of the baculoviral genome 
(therefore also present in great abundance inside the baculoviral infected cell) 
25 failing to hybridise to the VLP RNA. When a probe specific for the plO 

mRNA, a late gene product from the baculovirus, is hybridised to the RNA 
extracted from the purified VLPs, no signal occurs on the horthern blot. 
Methods and materials. RNA extracted from VLPs were northern blotted 
by standard procedures and probed with a 32 P-labelled 245 bp Xba I/Eco 
30 RI fragment from the baculoviral transfer vector pAcUW3 1 having 

sequences V to the start of transcription of the pi 0 promoter for AcMNPV 
baculovirus. 

Thus RNA sequences contained within the ORF of HaSV capsid protein (p71) can 
35 produce VLPs that encapsidate only^RNAs having the p71 sequence. If there are 
exogenous sequences not from HaSV also on the p71 mRNA these then can be 
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encapsidated. This shows that sequences for toxins can be placed within the VLP if it 
possesses a p71 sequence. 

Protection. Normally labile RNA contained within the capsovector must be protected 
5 from degradation during the period between its encapsidation inside the cell where it 
was produced and its entry into the cell where it will effect toxicity. This is 
particularly important because part of this period is spent in the insect gut that is a 
highly degradative milieu for both RNA and protein. This function is performed by 
the 240 copies of the capsid protein which form a protective shell around the virion 
10 RNA. 

The protective properties of the HaSV coat protein for both the virion and the VLP 
was shown by western blots of the capsid protein from HaSV particles, HaSV VLPs, 
and lipophorin before and after timed exposure to the gut contents of Miothis larvae: 
1 5 The data shows that: 

1 . When a non-viral globular protein like lipophorin is exposed to the contents 
of the heliothis midgut, rapid degradation of the protein occurs within 10 minutes. 

2. When either an HaSV virion or VLP is exposed in the same manner, minimal 
degradation of the protein occurs despite extended exposure times (>2 hr). 

20 Thus, when translated, the RNA sequences contained within the p71 ORF lead to 
protection of the protein and RNA of the VLP. 

Methods and materials. The midguts of fifth instar Heliothis armigera were 
excised, their contents removed and centrifuged at 14,500 x g for 15 min. 
Into 10 ml of the contents were placed 1 ml of a solution having 1 mg protein 

25 of either lipophorin, HaSV virions or HaSV VLPs. At timed intervals 1 ml of 

this solution was removed and immediately boiled in SDS-PAGE sample 
buffer. SDS-PAGE and immunoblots with the respective afttisera were then 
carried out with standard procedures. 

30 Cell Entry. Studies of animal host/virus systems have shown that entry of virions into 
cells is mediated by cellular receptors located on their exteriors and viral acceptor 
proteins or VAPs (Lentz, 1990). The HaSV VAP is by exclusion of any other 
possibility, p71, .it being the sole protein component of the HaSV capsid (Hanzlik et 
al, 1993, 1995). In particular, it is believed that a specific region of p71, between 

35 residue 274 and 439 of the protein sequence, is responsible for the binding of HaSV to 
the presently unknown host cell receptor. 
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entry into a new host cell. The maturation cleavage is mediated by the encapsidated 
RNA in the interior of the capsid (Wery et al, 1994). The maturation cleavage 
displays itself in HaSV by the appearance of a 64 kDa protein and the disappearance 
the 71kDa precursor similar to the nodaviruses (Gallagher and Rueckert, 1988)). 

5 

That this process occurs for HaSV VLPs is shown by observing an immunoblot of 
extracts of cells infected with Bacp71 and expressing p71 and proteins extracted from 
purified HaSV virions and purified HaSV VLPs. The blot shows that for the cell 
extract, p71 is expressed in the majority with minor expression of lower molecular 

10 weight products that are presumed degradation products. For proteins extracted from 
HaSV virions, the 64 kDa cleavage produce is present in the great majority with only a 
minor presence of the 71 kDa precursor. For proteins from VLPs, the 64 kDa and 71 
kDa proteins are present approximately equal amounts showing that the maturation 
cleavage does occur although not as efficiently as with viral RNApresent inside the 

15 particles. 

Thus HaSV VLPs are in a condition to be uncoated when they enter a host cell. This 

process is mediated by RNA sequences in the coat protein ORF. 

Methods and materials. An extract of cells infected with Bacp7 1 for three 

2 

20 days was made by pelleting cells from a 25 cm flask and lysing them in 

phosphate-buffered saline with freeze-thaw. SDS-PAGE and immunoblotting 
with anti-sera against HaSV was performed on the cell extract and proteins 
from VLPs and virions according to standard procedures. 

25 Translation. As a general rule, ribosomes responsible for translating proteins initiate 
translation by binding to the cap structure of an mRNA then scanning to the first MET 
placed in an appropriate context. This presents an initial difficulty in translating a 
toxin sequence on an RNA from a protein product (the VLP) translated from the 
region of the mRNA where the encapsidation signal resides (see above), 

30 

This difficulty can be dealt with in either of two ways: trans-encapsidation or through 
the use of internal ribosome entry sites (IRESs). 

1 . Trans-encapsidation. Trans-encapsidation is the process whereby an RNA 

strand is produced with the toxin sequence placed in good translatable context before 
35 the p71 encapsidation sequence. Translation of an RNA sequence is not required for 
its encapsidation by proteins produced from another mRNA. In this way a VLP 
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produced from an RNA having a translatable p71 encapsidates an RNA with a 
nontranslatable p71 sequence but possessing the encapsidation signal that ensures the 
encapsidation of the strand with a translatable toxin sequence. 

2. IRESs. IRESs are one of the exceptions to the general rule noted above for 

5 translation They are sequences on an RNA strand that allow ribosomes to bind an 

RNA internally instead of at the 5' cap to initiate protein translation. They are located 
in the 5' regions of picornavirus vRNAs and specialised genes from certain organisms 
(Jackson et al, 1994)). They have been employed to allow the translation of two 
proteins from the same mRNA (Lipsick and Smarda, 1993) and thus can be employed 

10 to express a toxin from a sequence located, after a translatable p71. An advantage to 
this approach is that many IRESs are host specific. If the IRES used in the RNA 
capsovector is able to function only in the target organism, the toxin Is produced only 
in the target organism and not in the organism producing the RNA capsovector. A 
picornavirus with a targeted pest insect host will possess a suitable IRES for a 

15 capsovector. For example, an IRES from a picornavirus with a Heliothis host will be 
used in a capsovector constructed with p71 from HaSV and a cytotoxin such as the 
ricin A fragement. This particular capsovector will affect only heliothine insects and 
no others as well as not producing a ricin A fragment in the plant producing the 
capsovector. 

20 

Translation into Toxicity. 

Ultimately, all of the above abilities of encapsidation, protection, entiy, and uncoating 
must be induced or accomplished by sequences in the RNA strand that produce the 
25 capsovector. Also important are sequences on the RNA, derived from other viral and 
non-viral sources as well as from HaSV, that are responsible for the toxic activity and 
if required, translation into protein that confer the toxicity. These types of sequences 
will be dealt with in separate sections. 

30 RNA sequences leading to toxicity of the organism or cell can either be translated into 
protein or the sequences cause secondary structures to made on the RNA strand that 
lead to toxicity. 

1 . Toxicity from protein sequences. Both types of toxins, nerve toxins, specific 

for insects and work at the level of the organism (Binnington and Baule, 1993) and 
35 cytotoxins (Stripe and Barbeiri, 1986), toxic only to the cell the capsovector has 

entered, can be used with RNA capsovectors. However, the former type of toxin will 
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have to have a secretion signal appropriate to midgut cells. Examples of these proteins 
are described inMaeda et al, 1991. 

2. Toxicity from RNA secondary structure sequences. RNA secondary 

structures will have the ability to cause toxicity to the cell to which they have been 
5 vectored by RNA capsovectors. These structures are caused by primary sequences; 
however, any number of primary sequences on the RNA strand can lead to the same 
toxic secondary structure. There are three types of these sequence structures that will 
be appropriate for RNA capsovectors: antisense sequences, ribozymes and mimicking 
structures. The first two types are reviewed by Eguchi et al., 1991 arid elsewhere 
10 herein, respectively and aimed at preventing the expression of key cellular enzymes. 
The latter is novel and will be detailed. 

• It is the activity of the HaSV replicase and not virosis or accumulation of viruses 
that causes the midgut cell to cease functioning. This is shown by data generated 
from the following experiment. When protoplasts are transfected with genes that 

1 5 make a replicatable RNA1 and only the capsid protein and not a replicatable RNA2 

(Rl-HC and VCAPB according to procedures listed above), stunting occurs. When 
the stunted larvae are extracted for RNA which is then northern blotted with probe 
for HaSV nucleic acid, only RNA1 of HaSV is seen to be present. Stunting does 
not occur when the protoplasts are transfected with genes that do not make a 

20 replicatable RNA1 (lacking an effective ribozyme to cleave after the last viral base 

in the gene) and only the capsid protein and not a replicatable RNA2 (Rl-HDV and 
VCAPB according to procedures listed elsewhere in patent). When the stunted 
larvae are extracted for RNA which is then northern blotted with probe for HaSV 
nucleic acid, no HaSV RNA is seen to be present. 

25 

• These data are consistent with RNA1 being encapsidated and able to enter midgut 
cells of the larvae. RNA1 is able to self-replicate but not produce virions as there is 
no replicatable RNA2 which has the p71 ORF. The self-replication leads to 
antibiosis due to apoptosis of the midgut cells having the replicating RNA1 . The 

30 particles made in the same manner but having RNA1 not able to self-replicate (due 

to the 3' sequences left by the defective ribozyme) were unable to stunt the larvae. 

• As shown in other systems, the activity of the replicase may be mimicked by 
placing a by-product of replicase activity into the cell. This action "tricks" the cell 
into initiating anti-viral measures such as shut-down of protein synthesis or cell 

35 death by apoptosis (the activity believed to be responsible for the antibiosis caused 

by HaSV). 
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• One such by-product of the replicase's activity is double stranded RNA, an 
intermediate of RNA replication of tetraviruses (du Plessis et al., 1991). Other 
systems have shown that transfection of double stranded RNA into cells causes 
anti-viral measures to be initiated in them. 

• Double-stranded RNA can be delivered to midgut cells by RNA capsovectors by a 
synthetic gene construct which produces a large stem-loop structure when 
transcribed. This structure is made by making the 3' half the reverse complement 
of the 5' half which then self-hybridises into the stem-loop. The RNA can be of 
viral, from either HaSV strand or of non-viral origin. 

Protein Capsovectors. 

Protein capsovectors are VLPs composed of a modified capsid protein or of a mixture 
of the modified capsid protein and the unmodified capsid protein. ^The modified coat 
protein is the coat protein, p71, fused to a fragment of a cytotoxin of either a plant or 
bacterial origin. When expressed, the coat proteins and coat protein-toxin fusions will 
self-assemble into the protein capsovector. Similar to the RNA capsovectors, protein 
capsovectors will not be self-replicating entities like viruses. Upon being eaten by an 
insect pest, the structure of the capsovector will vector the toxin moiety to inside the 
midgut cell by preventing proteolysis of the toxin moiety in the midgut and entering 
the midgut cells in a manner similar to what occurs for a virus particle. Upon entry, 
the capsovector will expose the cytotoxic moiety which will then kill the cell. Large 
numbers of midgut cells killed by capsovectors will cause antibiosis to the feeding 
insects. It is believed that a single gene will be able to express the capsovector. 

In concept, protein capsovectors are similar to the immunotoxins successfully used in 
human cancer therapy where cytotoxic moieties are vectored to specific cells by fusion 
to a specific binding moiety such as antibodies or cytokines. By themselves, the 
cytotoxin fragments do not display toxicity even when injected intravenously. Only 
when they are attached or fused to a binding element does cytotoxicity occur to those 
cells to which the element binds. The binding element of the protein capsovector is 
the VAP part (see above) of the HaSV VLP which is able to bind and enter midgut 
cells of heliothis larvae. 

However, the protein capsovector is^ distinct from immunotoxins in that its structure 
also protects the toxin moiety from degradation in addition to binding to the midgut 
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cell. The toxin fragment will be contained within the capsid shell until the capsovector 
enters the midgut cell Also, it is the capacity of capsovectors to protect the toxin 
moiety from degradation in the midgut lumen that makes it distinct from other insect 
control factors that are fused to elements that only "interact" with midgut cells. 

5 

In the following sections, each of the two elements of a capsovector, the cytotoxic 
fragment and a viral coat protein gene, will be described followed by descriptions of 
how the capsovector are constructed with data supporting their feasibility. 

10 The toxin. Several cytotoxin fragments suitable for protein capsovectors are available 
in readily accessible form. , They have extensive literature describing their activity in 
various fusions for use in immunotoxins (Thorpe et al., 1982). For expression in 
plants, plant-derived fragments of proteinaceous toxins, such as ricin A fragment, 
which are not toxic to plant cells but toxic to animal cells will be the most suitable. 

1 5 For expression in microorganisms, toxins of bacterial origin may be the most suitable, 
it being that they are not toxic to the microorganism producing the protein 
capsovectors. 

As described in the section on RNA casp so vectors, p71 has the ability to self-assemble 
20 into VLPs when expressed in various non-host expression systems. Also shown was 
the ability of the VLPs and virions to resist degradation, and bind to, then enter, 
midgut cells. The critical question concerning the feasibility of the pr otein 
capsovector using p71, the HaSV coat protein, is the ability of VLPs to form with the 
modified coat proteins fused to toxins. 

25 

Construction of model protein capsovectors with capsid protein, p71 and a 
reporter peptide. The cytotoxin fragment can be fused to p71 in a ^number of ways. 
In order to test the possibilities, a reporter peptide fragment was used in place of the 
toxins. This allows a more rapid characterisation of the products as immunodetection 

30 of the exogenous fragment is facilitated by a commercially available monoclonal 

antibody specific for the fragment (IBI FLAG Biosystem). Two constructs were made 
using standard techniques. The FlagT construct placed the reporter fragment, sized 
2501 Da, at the C-terminus of p71 (Fig. 8). The FlagM construct placed a reporter 
fragment, sized 1243 Da, in the middle of th6 p71 sequence near the site where p71 is 

35 cleaved into p64 and p7 (Fig. 8). Both constructs should have resulted in the reporter 
fragment being placed into the interior of the VLP where p7 is located. 
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Formation of VLPs made with modified p71 Recombinant baculoviruses were 
made with the modified p71 genes by standard techniques (King and Possee, 1992) 
and used to infect Sf9 cells. When these cells were processed and examined as before 
with TEM, particles highly similar to VLPs made with unmodified p7 L and HaSV 
virions were observed. The particles were purified on CsCl gradients and were 
examined with SDS-PAGE, immunoblots, negatively stained TEM. Ia addition, the 
particles were tested for their ability to protect the FLAG epitope on the fused peptide 
fragment. These experiments showed: 

1 . The purified particles were highly similar to HaSV virions and VLPs made 
with unmodified p71 and showed the characteristic 35-40 nm diameter 
spheres with a fraction having the electron dense cores. The FlagT particle 
possessed a buoyant density of 1.3 1 g/ml compared to 1.29 g/ml for 
unmodified VLPs and virions. / 

2. Proteins extracted from the particles and observed with SDS-PAGE had the 
molecular weights predicted from the constructs (1243 Da and 2501 Da for 
FlagM and FlagT respectively. In addition, post-assembly cleavage into p64 
was observed for the FlagM particle. No processing was seen in the FlagT 
particle. 

3 . Proteins extracted from the particles reacted with both the anti-p71 antisera 
and with the FLAG monoclonal antibody on immunoblots. 

4. The FLAG epitope was protected during exposure to heliothis midgut 
contents. This is shown by the FLAG epitope remaining at the original 
molecular weight and therefore undegraded. 

Hybrid VLPs. Although VLPs can be made with only modified p71 fused to the 
reporter peptide, and protection of the exogenous reporter peptide bccurs, a protein 
capsovector made with both native p71 and modified p71 fused to a cytotoxin may 
function better. At present it is not clear what properties of the "native" VLP, if any, 
are altered with the addition of the exogenous, fused peptides to p71 . If any 
deleterious properties arise such as poor stability of the particle, a resolution to the 
problem will be to produce a hybrid particle. This will minimise any disruption of 
desirable properties of the native VLP by the modified p71 . 

Hybrid VLP expression in plants.. Three ways can be envisioned to express, either 
a transgenic plant, the hybrid capsovector which requires two distinct, but closely 
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related proteins. The most obvious is two insert two genes into the plant, one for each 
protein. However, the use of elements from plant viruses can make it possible to 
express a capsovector from a single gene. These elements are suppressible stop 
contexts and frame-shift sequences that are detailed by Sleat and Wilson (1992). The 
use of these elements make it feasible to precisely regulate the ratio of coat protein to 
coat protein-toxin fusion. 
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SEQUENCE LISTING 



GENERAL INFORMATION: 
(i) APPLICANT: Commonwealth Scientific and Industrial 

Research Organisation and 
Pacific Seeds Pty. Ltd. 

(ia) INVENTORS: P. D. CHRISTIAN, K. H. J . GORDON and 

T. N. HANZLIK 

(ii) TITLE OF INVENTION: INSECT VIRUSES AND THEIR USES IN 

PROTECTING PLANTS 

(iii) NUMBER OF SEQUENCES: 5 3 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DAVIES COLLISON CAVE 

(B) STREET: 1 LITTLE COLLINS STREET 

(C) CITY: MELBOURNE 

(D) STATE: VICTORIA 

( E ) COUNTRY : AUSTRALIA 

(F) ZIP: 3000 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 13 AUGUST 19 93 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 
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(A) NAME: JOHN M. S LATTERY 

(B) REGISTRATION NUMBER: NA 

(C) REFERENCE /DOCKET NUMBER: 1613611 

5 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (613) 254 2777 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GGATCCACAG NNN 



20 (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



ATGGGCGATG CCGGCGTCGC GTTCACAG 



35 



125 



(2} INFORMATION FOR SEQ ID NO: 3: 

(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

10 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



ATGGAGGATG CTGGAGTGGC GTCACAG 



15 



(2) INFORMATION FOR SEQ ID NO: 4: 

20 (l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 

(11) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

30 

ATGAGCGAGG CCGGCGTCGC GTCACAG 27 
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(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 



(11) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID 



10 



CCATCGATGC CGGACTGGTA TCCCAGGGGG 
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(2) INFORMATION FOR SEQ ID NO : 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid. 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CCATCGATGC CGGACTGGTA TCCCGAGGGA C 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
CCATCGATGA TCCAGCCTCC TCGCGGCGCC GGATGGGCA 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
GGAGATCTAC ATATGGGAGA TGCTGGAGTG 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
GTAGCGAACG TCGAGAA 



{2) INFORMATION FOR SEQ ID NO: 19: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
GGGGGATCCT CAGTTGTCAG TGGCGGGGTA G 



[2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(lij MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 



GGGGATCCCT AATTGGCACG AGCGGCGC 



(2) INFORMATION FOR SEQ ID NO: 21: 



(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 9 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AATTACATAT GGCGGCCGCC GTTTCTGCC 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 
{C! STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
AATTACATAT GTTCGCGGCC GCCGTTTCT 



(2) INFORMATION FOR SEQ ID NO: 23: 



136 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 ammo acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein - N terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

10 

Phe Ala Ala Ala Val Ser Ala Phe Ala Ala Asn Met Leu Ser Ser Val 
15 10 15 

15 Leu Lys Ser 



20 (2} INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein - internal 
30 (XI) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



Pro Thr Leu Val Asp Gin Gly Phe Trp He Gly Gly Gin Tyr Ala Leu 
1 5 10 * 15 
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Thr Pro Thr Ser 
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(2) INFORMATION FOR SEQ ID NO: 25: 



(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein - internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Phe Ala Ala Ala Val Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 26: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ ii) MOLECULE TYPE: RNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
GCGCCCCCUG GGAUACCAGG AUC 



(2) INFORMATION FOR SEQ ID NO: 27: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
TCAGCAGGTG GCATAGG 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 6 . . 32 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 



CCCAT ATG GGC GAT GCC GGC GTC GCG TCA CAG 
Met Gly Asp Ala Gly Val Ala Ser Gin 
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(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein - N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Gly Asp Ala Gly Val Ala Ser Gin 
1 5 



(2} INFORMATION FOR SEQ ID NO: 30: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 6.-32 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



CCCAT ATG AGC GAG GCC GGC GTC GCG TCA CAG 
Met Ser Glu Ala Gly Val Ala Ser Gin 
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5 (2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid. 
10 (D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein - N- terminal 



15 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



Met Ser Glu Ala Gly Val Ala Ser Gin 
1 5 

20 

(2) INFORMATION FOR SEQ ID NO: 32: 

(1) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: DNA 

<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..27 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
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ATG GGA GAT GCT GGA GTG GCG TCA CAG 
Met Gly Asp Ala Gly Val Ala Ser Gin 
1 5 

5 
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(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(DJ TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
GGGGGATCCG TTCTGCCTCC CCGGAC 

(2) INFORMATION FOR SEQ ID NO: 39: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 37.. 5145 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

GTTCTGCCTC CCCCGGACGG TAAATATAGG GGAACA ATG TAC GCG AAA GCG ACA 

Met Tyr Ala Lys Ala Thr 



151 



GAA GTC CAG AGG CGC CAC GGC TCC AGC ATT GAG CTG CGC ATC ACT CGC 1014 
Glu Val Gin Arg Arg His Gly Ser Ser lie Glu Leu Arg He Thr Arg 
315 320 325 

5 GCG CCA CCT GGA GAC CGC ATG CTG GCC GTC GTC CCA AGG ACG TCC CAA 1062 

Ala Pro Pro Gly Asp Arg Met Leu Ala Val Val Pro Arg Thr Ser Gin 
330 335 340 

GGC CTC TGC AGA ATC CCA AAC ATC TTT TAT TAC GCC GAC GCG TCG GGC 1110 
10 Gly Leu Cys Arg He Pro Asn He Phe Tyr Tyr Ala Asp Ala Ser Gly 

345 350 355 

ACT GAG CAT AAG ACC ATC CTT ACG TCA CAG CAC AAA GTC AAC ATG CTG 1158 
Thr Glu His Lys Thr He Leu Thr Ser Gin His Lys Val Asn Met Leu 
15 360 365 370 

CTC AAT TTT ATG CAA ACG CGT CCT GAG AAG GAA CTA GTC GAC ATG ACC 12 06 

Leu Asn Phe Met Gin Thr Arg Pro Glu Lys Glu Leu Val Asp Met Thr 

375 380 385 330 

20 

GTC TTG ATG TCG TTC GCG CGC GCT AGG CTG CGC GCG ATC GTG GTC GCC 12 54 

Val Leu Met Ser Phe Ala Arg Ala Arg Leu Arg Ala He Val Val Ala 
395 400 405 

25 TCA GAA GTC ACC GAG AGC TCC TGG AAC ATC TCA CCG GCT GAC CTG GTC 1302 

Ser Glu Val Thr Glu Ser Ser Trp Asn He Ser Pro Ala Asp Leu Val 
410 415 420 

CGC ACT GTC GTG TCT CTT TAC GTC CTC CAC ATC ATC GAG CGC CGA AGG 13 50 

30 Arg Thr Val Val Ser Leu Tyr Val Leu His He He Glu Arg Arg Arg 

425 430 435 

GCT GCG GTC GCT GTC AAG ACC GCC AAG GAC GAC GTC TTT GGA GAG ACT 13 98 

Ala Ala Val Ala Val Lys Thr Ala Lys Asp Asp Val Phe Gly Glu Thr 
35 440 445 450 
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TCG TTC TGG GAG AGT CTC AAG CAC GTC TTG GGC TCC TGT TGC GGT CTG 14 4 6 

Ser Phe Trp Glu Ser Leu Lys His Val Leu Gly Ser Cys Cys Gly Leu 
455 460 465 470 

5 CGC AAC CTC AAA GGC ACC GAC GTC GTC TTT ACT AAG CGC GTC GTC GAT 14 94 

Arg Asn Leu Lys Gly Thr Asp Val Val Phe Thr Lys Arg Val Val Asp 
475 430 485 

AAG TAC CGA GTC CAC TCG CTC GGA GAC ATA ATC TGC GAC GTC CGC CTG 1542 
10 Lys Tyr Arg Val His Ser Leu Gly Asp lie He Cys Asp Val Arg Leu 

490 495 500 

TCC CCT GAA CAG GTC GGC TTC CTG CCG TCC CGC GTA CCA CCT GCC CGC 15 90 

Ser Pro Glu Gin Val Gly Phe Leu Pro Ser Arg Val Pro Pro Ala Arg 
15 505 510 515 

GTC TTT CAC GAC AGG GAA GAG CTT GAG GTC CTT CGC GAA GCT GGC TGC 1638 

Val Phe His Asp Arg Glu Glu Leu Glu Val Leu Arg Glu Ala Gly Cys 

520 525 530 

20 

TAC AAC GAA CGT CCG GTA CCT TCC ACT CCT CCT GTG GAG GAG CCC CAA 168 6 

Tyr Asn Glu Arg Pro Val Pro Ser Thr Pro Pro Val Glu Glu Pro Gin 

535 540 545 550 

25 GGT TTC GAC GCC GAC TTG TGG CAC GCG ACC GCG GCC TCA CTC CCC GAG 17 34 

Gly Phe Asp Ala Asp Leu Trp His Ala Thr Ala Ala Ser Leu Pro Glu 
555 560 565 

TAC CGC GCC ACC TTG CAG GCA GGT CTC AAC ACC GAC GTC AAG CAG CTC 17 82 

30 Tyr Arg Ala Thr Leu Gin Ala Gly Leu Asn Thr Asp Val Lys Gin Leu 

570 575 580 

AAG ATC ACC CTC GAG AAC GCC CTC AAG ACC ATC GAC GGG CTC ACC CTC 18 30 

Lys He Thr Leu Glu Asn Ala Leu Lys Thr He Asp Gly Leu Thr Leu 
35 585 590 595 
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TCC CCA GTC AGA GGC CTC GAG ATG TAC GAG GGC CCG CCA GGC AGC GGC 18 7 8 

Ser Pro Val Arg Gly Leu Glu Met Tyr Glu Gly Pro Pro Gly Ser Gly 
600 605 610 

5 AAG ACG GGC ACC CTC ATC GCC GCC CTT GAG GCC GCG GGC GGT AAA GCA 1926 

Lys Thr Gly Thr Leu lie Ala Ala Leu Glu Ala Ala Gly Gly Lys Ala 
615 620 625 630 

CTT TAC GTG GCA CCC ACC AGA GAA CTG AGA GAG GCT ATG GAC CGG CGG 197 4 

10 Leu Tyr Val Ala Pro Thr Arg Glu Leu Arg Glu Ala Met Asp Arg Arg 

635 640 645 



ATC AAA CCG CCG TCC GCC TCG GCT ACG CAA CAT GTC GCC CTT GCG ATT 
lie Lys Pro Pro Ser Ala Ser Ala Thr Gin His Val Ala Leu Ala lie 
650 655 660 



15 



20 



25 



CTC CGT CGT GCC ACC GCC GAG GGC GCC CCT TTC GCT ACC GTG GTT ATC 

Leu Arg Arg Ala Thr Ala Glu Gly Ala Pro Phe Ala Thr Val Val He 

665 670 675 

GAC GAG TGC TTC ATG TTC CCG CTC GTG TAC GTC GCG ATC GTG CAC GCC 

Asp Glu Cys Phe Met Phe Pro Leu Val Tyr Val Ala He Val His Ala 
680 685 690 

TTG TCC CCG AGC TCA CGA ATA GTC CTT GTA GGG GAC GTC CAC CAA ATC 

Leu Ser Pro Ser Ser Arg He Val Leu Val Gly Asp Val His Gin He 

695 700 705 710 



GGG TTT ATA GAC TTC CAA GGC ACA AGC GCG AAC ATG CCG CTC GTT CGC 
30 Gly Phe He Asp Phe Gin Gly Thr Ser Ala Asn Met Pro Leu Val Arg 

715 720 725 



GAC GTC GTT AAG CAG TGC CGT CGG CGC ACT TTC AAC CAA ACC AAG CGC 
Asp Val Val Lys Gin Cys Arg Arg Arg Thr Phe Asn Gin Thr Lys Arg 
730 735 740 



35 
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TGT CCG GCC GAC GTC GTT GCC ACC ACG TTT TTC CAG AGC TTG TAC CCC 2 310 

Cys Pro Ala Asp Val Val Ala Thr Thr Phe Phe Gin Ser Leu Tyr Pro 
745 750 755 

5 GGG TGC ACA ACC ACC TCA GGG TGC GTC GCA TCC ATC AGC CAC GTC GCC 2358 

Gly Cys Thr Thr Thr Ser Gly Cys Val Ala Ser lie Ser His Val Ala 
760 765 770 

CCA GAC TAC CGC AAC AGC CAG GCG CAA ACG CTC TGC TTC ACG CAG GAG 2 40 6 

10 Pro Asp Tyr Arg Asn Ser Gin Ala Gin Thr Leu Cys Phe Thr Gin Glu 

775 780 785 790 

GAA AAG TCG CGC CAC GGG GCT GAG GGC GCG ATG ACT GTG CAC GAA GCG 24 54 

Glu Lys Ser Arg His Gly Ala Glu Gly Ala Met Thr Val His Glu Ala 
15 795 800 805 

CAG GGA CGC ACT TTT GCG TCT GTC ATT CTG CAT TAC AAC GGC TCC ACA 2 502 

Gin Gly Arg Thr Phe Ala Ser Val He Leu His Tyr Asn Gly Ser Thr 
810 815 820 



20 



GCA GAG CAG AAG CTC CTC GCT GAG AAG TCG CAC CTT CTA GTC GGC ATC 
Ala Glu Gin. Lys Leu Leu Ala Glu Lys Ser His Leu Leu Val Gly lie 
825 830 835 



25 
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ACG CGC CAC ACC AAC CAC CTG TAC ATC CGC GAC CCG ACA GGT GAC ATT 
Thr Arg His Thr Asn His Leu Tyr He Arg Asp Pro Thr Gly Asp He 
840 345 850 

5 GAG AGA CAA CTC AAC CAT AGC GCG AAA GCC GAG GTG TTT ACA GAC ATC 

Glu Arg Gin Leu Asn His Ser Ala Lys Ala Glu Val Phe Thr Asp He 
855 860 865 870 

CCT GCA CCC CTG GAG ATC ACG ACT GTC AAA CCG AGT GAA GAG GTG CAG 
10 Pro Ala Pro Leu Glu He Thr Thr Val Lys Pro Ser Glu Glu Val Gin 

875 880 8G5 

CGC AAC GAA GTG ATG GCA ACG ATA CCC CCG CAG AGT GCC ACG CCG CAC 
Arg Asn Glu Val Met Ala Thr He Pro Pro Gin Ser Ala Thr Pro His 
15 890 895 900 

GGA GCA ATC CAT CTG CTC CGC AAG AAC TTC GGG GAC CAA CCC GAC TGT 
Gly Ala He His Leu Leu Arg Lys Asn Phe Gly Asp Gin Pro Asp Cys 
905 910 915 



20 



GGC TGT GTC GCT TTG GCG AAG ACC GGC TAC GAG GTG TTT GGC GGT CGT 
Gly Cys Val Ala Leu Ala Lys Thr Gly Tyr Glu Val Phe Gly Gly Arg 

920 925 930 



156 



GCC AAA ATC AAC GTA GAG CTT GCC GAA CCC GAC GCG ACC CCG AAG CCG 

Ala Lys lie Asn Val Glu Leu Ala Glu Pro Asp Ala Thr Pro Lys Pro 

935 940 945 950 

5 

CAT AGG GCG TTC CAG GAA GGG GTA CAG TGG GTC AAG GTC ACC AAC GCG 

His Arg Ala Phe Gin Glu Gly Val Gin Trp Val Lys Val Thr Asn Ala 

955 960 965 

10 TCT AAC AAA CAC CAG GCG CTC CAG ACG CTG TTG TCC CGC TAC ACC AAG 

Ser Asn Lys His Gin Ala Leu Gin Thr Leu Leu Ser Arg Tyr Thr Lys 
970 975 980 

CGA AGC GCT GAC CTG CCG CTA CAC GAA GCT AAG GAG GAC GTC AAA CGC 
15 Arg Ser Ala Asp Leu Pro Leu His Glu Ala Lys Glu Asp Val Lys Arg 

985 990 995 

ATG CTA AAC TCG CTT GAC CGA CAT TGG GAC TGG ACT GTC ACT GAA GAC 
Met Leu Asn Ser Leu Asp Arg His Trp Asp Trp Thr Val Thr Glu Asp 
20 1000 1005 1010 

GCC CGT GAC CGA GCT GTC TTC GAG ACC CAG CTC AAG TTC ACC CAA CGC 

Ala Arg Asp Arg Ala Val Phe Glu Thr Gin Leu Lys Pne Thr Gin Arg 
1015 1020 1025 1030 

25 

GGC GGC ACC GTC GAA GAC CTG CTG GAG CCA GAC GAC CCC TAC ATC CGT 

Gly Gly Thr Val Glu Asp Leu Leu Glu Pro Asp Asp Pro Tyr lie Arg 
1035 1040 1045 

30 GAC ATA GAC TTC CTT ATG AAG ACT CAG CAG AAA GTG TCG CCC AAG CCG 

Asp lie Asp Phe Leu Met Lys Thr Gin Gin Lys Val Ser Pro Lys Pro 
1050 1055 1060 



35 



ATC AAT ACG GGC AAG GTC GGG CAG GGG ATC GCC GCT CAC TCA AAG TCT 
lie Asn Thr Gly Lys Val Gly Gin Gly He Ala Ala His Ser Lys Ser 
1065 1070 1075 



3270 
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CTC AAC TTC GTC CTC GCC GCT TGG ATA CGC ATA CTC GAG GAG ATA CTC 3313 
Leu Asn Phe Val Leu Ala Ala Trp He Arg He Leu Glu Glu He Leu 
1030 1085 1090 

5 CGT ACC GGG AGC CGC ACG GTC CGG TAC AGC AAC GGT CTC CCC GAC GAA 336 6 

Arg Thr Gly Ser Arg Thr Val Arg Tyr Ser Asn Gly Leu Pro Asp Glu 
1095 HOO 1105 1110 

GAA GAG GCC ATG CTG CTC GAA GCG AAG ATC AAT CAA GTC CCA CAC GCC 3414 
10 Glu Glu Ala Met Leu Leu Glu Ala Lys He Asn Gin Val Pro His Ala 

1115 1120 1125 



ACG TTC GTC TCG GCG GAC TGG ACC GAG TTT GAC ACC GCC CAC AAT AAC 34 62 

Thr Phe Val Ser Ala Asp Trp Thr Glu Phe Asp Thr Ala His Asn Asn 
15 1130 1135 1140 



20 



ACG AGT GAG CTG CTC TTC GCC GCC CTT TTA GAG CGC ATC GGC ACG CCT 
Thr Ser Glu Leu Leu Phe Ala Ala Leu Leu Glu Arg He Gly Thr Pro 
1145 1150 1155 

GCA GCT GCC GTT AAT CTA TTC AGA GAA CGG TGT GGG AAA CGC ACC TTG 
Ala Ala Ala Val Asn Leu Phe Arg Glu Arg Cys Gly Lys Arg Thr Leu 
1160 1165 1170 



25 CGA GCG AAG GGT CTA GGC TCC GTT GAA GTC GAC GGT CTG CTC GAC TCC 3606 

Arg Ala Lys Gly Leu Gly Ser Val Glu Val Asp Gly Leu Leu Asp Ser 
1175 H80 1185 1190 

GGC GCA GCT TGG ACG CCT TGC CGC AAC ACC ATC TTC TCT GCC GCC GTC 3654 
30 Gly Ala Ala Trp Thr Pro Cys Arg Asn Thr He Phe Ser Ala Ala Val 

1195 1200 1205 



35 



ATG CTC ACG CTC TTC CGC GGC GTC AAG TTC GCA GCT TTC AAA GGC GAC 
Met Leu Thr Leu Phe Arg Gly Val Lys Phe Ala Ala Phe Lys Gly Asp 
1210 1215 1220 



3702 



158 



GAC TCG CTC CTC TGT GGT AGC CAT TAG CTC CGT TTC GAC GCT AGC CGC 37 50 

Asp Ser Leu Leu Cys Gly Ser His Tyr Leu Arg Phe Asp Ala Ser Arg 
1225 1230 1235 

5 CTT CAC ATG GGC GAA CGT TAC AAG ACC AAA CAT TTG AAG GTC GAG GTG 37 98 

Leu His Met Gly Glu Arg Tyr Lys Thr Lys His Leu Lys Val Glu Val 
1240 1245 1250 

CAG AAA ATC GTG CCG TAC ATC GGA CTC CTC GTC TCC GCT GAG CAG GTC 3 8 46 

10 Gin Lys lie Val Pro Tyr He Gly Leu Leu Val Ser Ala Glu Gin Val 

1255 1260 1265 1270 

GTC CTC GAC CCT GTC AGG AGC GCT CTC AAG ATA TTT GGG CGC TGC TAC 38 94 

Val Leu Asp Pro Val Arg Ser Ala Leu Lys He Phe Gly Arg Cys Tyr 
15 1275 1280 1285 

ACA AGC GAA CTC CTT TAC TCC AAG TAC GTG GAG GCT GTG AGA GAC ATC 394 2 

Thr Ser Glu Leu Leu Tyr Ser Lys Tyr Val Glu Ala Val Arg Asp He 
1290 1295 1300 

20 

ACC AAG GGC TGG AGT GAC GCC CGC TAC CAC AGC CTC CTG TGC CAC ATG 3 990 

Thr Lys Gly Trp Ser Asp Ala Arg Tyr His Ser Leu Leu Cys His Met 

1305 1310 1315 

25 TCA GCA TGC TAC TAC AAT TAC GCG CCG GAG TCT GCG GCG TAC ATC ATC 4 0 33 

Ser Ala Cys Tyr Tyr Asn Tyr Ala Pro Glu Ser Ala Ala Tyr He He 
1320 1325 1330 

GAC GCT GTT GTT CGC TTT GGG CGC GGC GAC TTC CCG TTT GAA CAA CTG 4086 
30 Asp Ala Val Val Arg Phe Gly Arg Gly Asp Phe Pro Phe Glu Gin Leu 

1335 1340 1345 1350 

CGC GTG GTG CGT GCC CAT GTG CAG GCA CCC GAC GCT TAC AGC AGC ACG 4134 
Arg Val Val Arg Ala His Val Gin Ala Pro Asp Ala Tyr Ser Ser Thr 
35 1355 1360 1365 



159 



TAT CCG GCT AAC GTG CGC GCA TCG TGC CTT GAC CAC GTC TTC GAG CCC 4182 
Tyr Pro Ala Asn Val Arg Ala Ser Cys Leu Asp His Val Phe Glu Pro 
1370 1375 1330 

5 CGC CAG GCC GCC GCC CCG GCA GGT TTC GTT GCG ACA TGT GCG AAG CCG 4230 

Arg Gin Ala Ala Ala Pro Ala Gly Phe Val Ala Thr Cys Ala Lys Pro 
1365 1390 1395 

GAA ACG CCT TCT TCA CTT ACC GCG AAA GCT GGT GTT TCT GCG ACT ACA 4 27 3 

10 Glu Thr Pro Ser Ser Leu Thr Ala Lys Ala Gly Val Ser Ala Thr Thr 

1400 1405 1410 

AGC CAC GTT GCG ACT GGG ACT GCG CCC CCG GAG TCT CCA TGG GAT GCA 4 32 6 

Ser His Val Ala Thr Gly Thr Ala Pro Pro Glu Ser Pro Trp Asp Ala 
15 1415 1420 1425 1430 

CCT GCA GCC AAC AGC TTT TCG GAG TTA TTG ACA CCG GAG ACC CCG TCC 437 4 

Pro Ala Ala Asn Ser Phe Ser Glu Leu Leu Thr Pro Glu Thr Pro Ser 
1435 1440 1445 

20 

ACA TCA TCC TCG CCG TCA TCG TCT TCA TCG GAC TCC TCT ACA TCG TGT 4 4 22 

Thr Ser Ser Ser Pro Ser Ser Ser Ser Ser Asp Ser Ser Thr Ser Cys 
1450 1455 1460 

25 GGA AGG TCG CTC AGT GGT GGA GAC ACC GCA AGG ACC ACA GAA GAC TTG 4 470 

Gly Arg Ser Leu Ser Gly Gly Asp Thr Ala Arg Thr Thr Glu Asp Leu 
1465 1470 1475 

AAC AGC AGA AAG CCG CCT TCG CAA GAC AGG CAA TCA CGC TCG TCT GAA 4 518 

30 Asn Ser Arg Lys Pro Pro Ser Gin Asp Arg Gin Ser Arg Ser Ser Glu 

1480 1485 1490 

TGT CTG GAC AGA AGC GGA GAA AGG ACA GGC AGT TCG TTA ACT GCC CCC 4 566 

Cys Leu Asp Arg Ser Gly Glu Arg Thr Gly Ser Ser Leu Thr Ala Pro 
35 1495 1500 1505 1510 



160 



ACT GCT CCG AGC CCC TCA TTC TCA TTT TCG GAA AGA GCT CGA CTG GCG 

Thr Ala Pro Ser Pro Ser Phe Ser Phe Ser Glu Arg Ala Arg Leu Ala 
1515 1520 1525 

5 ACC GGG CCG ACT GTC GCC GCT GCG ACA TCA CCT TCG GCA ACC CCA TCC 

Thr Gly Pro Thr Val Ala Ala Ala Thr Ser Pro Ser Ala Thr Pro Ser 
1530 1535 1540 

TGC GCC ACG GAC CAG GTT GCC GCG AGG ACC ACG CCG GAC TTT GCG CCT 
10 Cys Ala Thr Asp Gin Val Ala Ala Arg Thr Thr Pro Asp Phe Ala Pro 

1545 1550 1555 



TTC CTG GGT TCC CAG TCT GCC CGT GCT GTC TCG AAG CCG TAC CGG CCC 
Phe Leu Gly Ser Gin Ser Ala Arg Ala Val Ser Lys Pro Tyr Arg Pro 
1560 1565 1570 



15 



20 



25 



30 



CCC ACG ACT GCC CGT TGG AAA GAA GTC ACC CCG CTC CAC GCG TGG AAG 4 806 

Pro Thr Thr Ala Arg Trp Lys Glu Val Thr Pro Leu His Ala Trp Lys 
1575 1580 1585 1590 

GGC GTG ACC GGA GAC CGA CCG GAA GTC AGG GAG GAC CCG GAG ACA GCG 43 54 

Gly Val Thr Gly Asp Arg Pro Glu Val Arg Glu Asp Pro Glu Thr Ala 
15S5 1600 1605 

GCG GTC GTC CAG GCT CTG ATC AGC GGC CGT TAT CCT CAG AAG ACG AAG 4 902 

Ala Val Val Gin Ala Leu lie Ser Gly Arg Tyr Pro Gin Lys Thr Lys 



CTT TCC TCC GAC GCA TCC AAA GGC TAC TCA AGA ACT AAG GGA TGC TCA 
Leu Ser Ser Asp Ala Ser Lys Gly Tyr Ser Arg Thr Lys Gly Cys Ser 
1625 1630 1635 



CAA TCC ACC TCT TTT CCT GCC CCG AGT GCG GAT TAC CAG GCC CGC GAC 
Gin Ser Thr Ser Phe Pro Ala Pro Ser Ala Asp Tyr Gin Ala Arg Asp 
35 1640 1645 1650 



161 



TGC CAG ACA GTC CGA GTC TGC CGC GCC GCT GCA GAG ATG GCG CGC TCA 5 04 6 

Cys Gin Thr Val Arg Val Cys Arg Ala Ala Ala Glu Met Ala Arg Ser 
1655 1660 1665 1670 

5 TGT ATT CAC GAG CCG TTG GCT TCA TCT GCC GCC AGT GCC GAC TTG AAG 50 94 

Cys lie His Glu Pro Leu Ala Ser Ser Ala Ala Ser Ala Asp Leu Lys 
1675 1680 1635 



10 



CGC ATA CGC TCT ACC TCG GAC TCT GTT CCC GAT GTA AAG ATC AGC AAG 
Arg lie Arg Ser Thr Ser Asp Ser Val Pro Asp Val Lys lie Ser Lys 
1690 1695 1700 



5142 



162 



5 



20 



35 



AGC GCA TGAAGGAACA AAATTAGTTT CCTTGTTCGT AAACAAGGTG GTCCCTCCCA 5198 
Ser Ala 



TTGAGGTAAA GACTCTGGTG AGTCCTCAAC GTTACTCGTT GAGTCTGCTG CGGTTCGATT 5253 



CCATTCCCAA GCAGCAAAGG GTGCGCAACT AGTACGGCGC CCCCTGGGAT ACCA 



10 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1704 ammo acids 

(B) TYPE: ammo acid 
(D) TOPOLOGY: linear 



(li) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



Met Tyr Ala Lys Ala Thr Asp Val Ala Arg Val Tyr Ala Ala Ala Asp 
25 1 5 10 15 

Val Ala Tyr Ala Asn Val Leu Gin Gin Arg Ala Val Lys Leu Asp Phe 
20 25 30 

30 Ala Pro Pro Leu Lys Ala Leu Glu Thr Leu His Arg Leu Tyr Tyr Pro 

35 40 45 

Leu Arg Phe Lys Gly Gly Thr Leu Pro Pro Thr Gin His Pro lie Leu 
50 55 60 



Ala Gly His Gin Arg Val Ala Glu Glu Val Leu His Asn Phe Ala Arg 



163 



Gly Arg Ser Thr Val Leu Glu He Gly Pro Ser Leu His Ser Ala Leu 
B5 90 95 

Lys Leu His Gly Ala Pro Asn Ala Pro Val Ala Asp Tyr His Gly Cys 
100 105 110 

Thr Lys Tyr Gly Thr Arg Asp Gly Ser Arg His He Thr Ala Leu Glu 
115 120 125 

Ser Arg Ser Val Ala Thr Gly Arg Pro Glu Phe Lys Ala Asp Ala Ser 
130 135 140 

Leu Leu Ala Asn Gly He Ala Ser Arg Thr Phe Cys Val Asp Gly Val 
145 150 155 160 

Gly Ser Cys Ala Phe Lys Ser Arg Val Gly He Ala Asn His Ser Leu 
165 1^0 175 

Tyr Asp Val Thr Leu Glu Glu Leu Ala Asn Ala Phe Glu Asn His Gly 
180 135 190 

Leu His Met Val Arg Ala Phe Met His Met Pro Glu Glu Leu Leu Tyr 
195 200 205 

Met Asp Asn Val Val Asn Ala Glu Leu Gly Tyr Arg Phe His Val He 
210 215 220 

Glu Glu Pro Met Ala Val Lys Asp Cys Ala Phe Gin Gly Gly Asp Leu 
225 230 235 240 

Arg Leu His Phe Pro Glu Leu Asp Phe He Asn Glu Ser Gin Glu Arg 
245 250 255 

Arg He Glu Arg Leu Ala Ala Arg Gly Ser Tyr Ser Arg Arg Ala Val 



164 



He Phe Ser Gly Asp Asp Asp Trp Gly Asp Ala Tyr Leu His Asp Phe 
275 280 285 

His Thr Trp Leu Ala Tyr Leu Leu Val Arg Asn Tyr Pro Thr Pro Phe 

290 295 300 

Gly Phe Ser Leu His He Glu Val Gin Arg Arg His Gly Ser Ser He 
305 310 315 320 

Glu Leu Arg He Thr Arg Ala Pro Pro Gly Asp Arg Met Leu Ala Val 
325 330 335 

Val Pro Arg Thr Ser Gin Gly Leu Cys Arg He Pro Asn He Phe Tyr 
340 345 350 

Tyr Ala Asp Ala Ser Gly Thr Glu His Lys Thr He Leu Thr Ser Gin 
355 360 365 

His Lys Val Asn Met Leu Leu Asn Phe Met Gin Thr Arg Pro Glu Lys 
370 375 380 

Glu Leu Val Asp Met Thr Val Leu Met Ser Phe Ala Arg Ala Arg Leu 
385 390 395 400 

Arg Ala He Val Val Ala Ser Glu Val Thr Glu Ser Ser Trp Asn He 
405 410 415 

Ser Pro Ala Asp Leu Val Arg Thr Val Val Ser Leu Tyr Val Leu His 
420 425 430 

He He Glu Arg Arg Arg Ala Ala Val Ala Val Lys Thr Ala Lys Asp 
435 440 4 45* 

Asp Val Phe Gly Glu Thr Ser Phe Trp Glu Ser Leu Lys His Val Leu 



165 



450 455 460 

Gly Ser Cys Cys Gly Leu Arg Asn Leu Lys Gly Thr Asp Val Val Phe 
465 470 475 480 

Thr Lys Arg Val Val Asp Lys Tyr Arg Val His Ser Leu Gly Asp He 
485 490 495 

He Cys Asp Val Arg Leu Ser Pro Glu Gin Val Gly Phe Leu Pro Ser 
500 505 510 

Arg Val Pro Pro Ala Arg Val Phe His Asp Arg Glu Glu Leu Glu Val 
515 520 525 

Leu Arg Glu Ala Gly Cys Tyr Asn Glu Arg Pro Val Pro Ser Thr Pro 
530 535 540 

Pro Val Glu Glu Pro Gin Gly Phe Asp Ala Asp Leu Trp His Ala Thr 
545 550 555 560 

Ala Ala Ser Leu Pro Glu Tyr Arg Ala Thr Leu Gin Ala Gly Leu Asn 
565 570 575 

Thr Asp Val Lys Gin Leu Lys He Thr Leu Glu Asn Ala Leu Lys Thr 
580 535 590 

He Asp Gly Leu Thr Leu Ser Pro Val Arg Gly Leu Glu Met Tyr Glu 
595 600 605 

Gly Pro Pro Gly Ser Gly Lys Thr Gly Thr Leu He Ala Ala Leu Glu 
610 615 620 

Ala Ala Gly Gly Lys Ala Leu Tyr Val Ala Pro Thr Arg Glu Leu Arg 
625 630 635 640 

Glu Ala Met Asp Arg Arg He Lys Pro Pro Ser Ala Ser Ala Thr Gin 



166 



645 650 655 

His Val Ala Leu Ala He Leu Arg Arg Ala Thr Ala Glu Gly Ala Pro 

660 665 670 

Phe Ala Thr Val Val He Asp Glu Cys Phe Met Phe Pro Leu Val Tyr 
675 680 685 

Val Ala He Val His Ala Leu Ser Pro Ser Ser Arg He Val Leu Val 
690 695 700 

Gly Asp Val His Gin He Gly Phe He Asp Phe Gin Gly Thr Ser Ala 
705 710 715 720 

Asn Met Pro Leu Val Arg Asp Val Val Lys Gin Cys Arg Arg Arg Thr 
725 730 735 

Phe Asn Gin Thr Lys Arg Cys Pro Ala Asp Val Val Ala Thr Thr Phe 
740 745 750 

Phe Gin Ser Leu Tyr Pro Gly Cys Thr Thr Thr Ser Gly Cys Val Ala 
755 760 765 

Ser He Ser His Val Ala Pro Asp Tyr Arg Asn Ser Gin Ala Gin Thr 
770 775 780 

Leu Cys Phe Thr Gin Glu Glu Lys Ser Arg His Gly Ala Glu Gly Ala 
785 790 795 BOO 

Met Thr Val His Glu Ala Gin Gly Arg Thr Phe Ala Ser Val He Leu 
805 810 815 

His Tyr Asn Gly Ser Thr Ala Glu Gin Lys Leu Leu Ala Glu Lys Ser 
820 825 830 

His Leu Leu Val Gly He Thr Arg His Thr Asn His Leu Tyr He Arg 



167 



335 



Asp Pro Thr Gly Asp lie Glu Arg Gin Leu Asn His Ser Ala Lys Ala 
850 855 860 

Glu Val Phe Thr Asp He Pro Ala Pro Lea Glu He Thr Thr Val Lys 
865 870 875 880 

Pro Ser Glu Glu Val Gin Arg Asn Glu Val Met Ala Thr He Pro Pro 

885 890 895 

Gin Ser Ala Thr Pro His Gly Ala He His Leu Leu Arg Lys Asn Phe 
900 905 910 

Gly Asp Gin Pro Asp Cys Gly Cys Val Ala Leu Ala Lys Thr Gly Tyr 
915 920 925 

Glu Val Phe Gly Gly Arg Ala Lys He Asn Val Glu Leu Ala Glu Pro 
930 935 940 

Asp Ala Thr Pro Lys Pro His Arg Ala Phe Gin Glu Gly Val Gin Trp 
945 950 955 960 

Val Lys Val Thr Asn Ala Ser Asn Lys His Gin Ala Leu Gin Thr Leu 
965 970 975 

Leu Ser Arg Tyr Thr Lys Arg Ser Ala Asp Leu Pro Leu His Glu Ala 
980 985 990 

Lys Glu Asp Val Lys Arg Met Leu Asn Ser Leu Asp Arg His Trp Asp 

995 1000 1005 



Trp Thr Val Thr Glu Asp Ala Arg Asp Arg Ala Val Phe Glu Thr Gin 
1010 1015 1020 



Leu Lys Phe Thr Gin Arg Gly Gly Thr Val Glu Asp Leu Leu Glu Pro 



168 



Asp Asp Pro Tyr lie Arg Asp He Asp Phe Leu Met Lys Thr Gin Gin 
1045 1050 1055 

Lys Val Ser Pro Lys Pro He Asn Thr Gly Lys Val Gly Gin Gly He 
1060 1065 1070 

Ala Ala His Ser Lys Ser Leu Asn Phe Val Leu Ala Ala Trp He Arg 
1075 1080 1085 

He Leu Glu Glu He Leu Arg Thr Gly Ser Arg Thr Val Arg Tyr Ser 
1090 1095 1100 

Asn Gly Leu Pro Asp Glu Glu Glu Ala Met Leu Leu Glu Ala Lys He 
1105 1110 1115 1120 

Asn Gin Val Pro His Ala Thr Phe Val Ser Ala Asp Trp Thr Glu Phe 
1125 1130 1135 

Asp Thr Ala His Asn Asn Thr Ser Glu Leu Leu Phe Ala Ala Leu Leu 
1140 1145 1150 

Glu Arg He Gly Thr Pro Ala Ala Ala Val Asn Leu Phe Arg Glu Arg 
1155 1160 1165 

Cys Gly Lys Arg Thr Leu Arg Ala Lys Gly Leu Gly Ser Val Glu Val 
1170 1175 1180 

Asp Gly Leu Leu Asp Ser Gly Ala Ala Trp Thr Pro Cys Arg Asn Thr 
1185 1190 1195 1200 

He Phe Ser Ala Ala Val Met Leu Thr Leu Phe Arg Gly Val Lys Phe 
1205 1210 " 1215 

Ala Ala Phe Lys Gly Asp Asp Ser Leu Leu Cys Gly Ser His Tyr Leu 



169 
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Arg Phe Asp Ala Ser Arg Leu His Met Gly Glu Arg Tyr Lys Thr Lys 
1235 1240 1245 

His Leu Lys Val Glu Val Gin Lys He Val Pro Tyr He Gly Leu Leu 
5 1250 1255 1260 

Val Ser Ala Glu Gin Val Val Leu Asp Pro Val Arg Ser Ala Leu Lys 
1265 1270 1275 1280 

10 He Phe Gly Arg Cys Tyr Thr Ser Glu Leu Leu Tyr Ser Lys Tyr Val 

1285 1290 1295 

Glu Ala Val Arg Asp He Thr Lys Gly Trp Ser Asp Ala Arg Tyr His 
1300 1305 1310 

15 

Ser Leu Leu Cys His Met Ser Ala Cys Tyr Tyr Asn Tyr Ala Pro Glu 
1315 1320 1325 

Ser Ala Ala Tyr He He Asp Ala Val Val Arg Phe Gly Arg Gly Asp 
20 1330 1335 1340 

Phe Pro Phe Glu Gin Leu Arg Val Val Arg Ala His Val Gin Ala Pro 
1345 1350 1355 1360 

25 Asp Ala Tyr Ser Ser Thr Tyr Pro Ala Asn Val Arg Ala Ser Cys Leu 

1365 1370 1375 

Asp His Val Phe Glu Pro Arg Gin Ala Ala Ala Pro Ala Gly Phe Val 
1380 1385 1390 

30 

Ala Thr Cys Ala Lys Pro Glu Thr Pro Ser Ser Leu Thr Ala Lys Ala 
1395 1400 1405 



35 



Gly Val Ser Ala Thr Thr Ser His Val Ala Thr Gly Thr Ala Pro Pro 
1410 1415 1420 



171 



Glu Ser Pro Trp Asp Ala Pro Ala Ala Asn Ser Phe Ser Glu Leu Leu 
142 5 1430 1435 1440 



Thr Pro Glu Thr Pro Ser Thr Ser Ser Ser Pro Ser Ser Ser Ser Ser 
1445 1450 1455 



Asp Ser Ser Thr Ser Cys Gly Arg Ser Leu Ser Gly Gly Asp Thr Ala 
1460 1465 1470 



Arg Thr Thr Glu Asp Leu Asn Ser Arg Lys Pro Pro Ser Gin Asp Arg 
1475 1480 1485 



Gin Ser Arg Ser Ser Glu Cys Leu Asp Arg Ser Gly Glu Arg Thr Gly 
1490 1495 1500 

Ser Ser Leu Thr Ala Pro Thr Ala Pro Ser Pro Ser Phe Ser Phe Ser 
1505 1510 1515 152C 

Glu Arg Ala Arg Leu Ala Thr Gly Pro Thr Val Ala Ala Ala Thr Ser 
1525 1530 1535 



Pro Ser Ala Thr Pro Ser Cys Ala Thr Asp Gin Val Ala Ala Arg Thr 
1540 1545 1550 



Thr Pro Asp Phe Ala Pro Phe Leu Gly Ser Gin Ser Ala Arg Ala Val 
1555 1560 1565 



Ser Lys Pro Tyr Arg Pro Pro Thr Thr Ala Arg Trp Lys Glu Val Thr 
1570 1575 1580 



Pro Leu His Ala Trp Lys Gly Val Thr Gly Asp Arg Pro Glu Val Arg 
1585 1590 1595 1600 



Glu Asp Pro Glu Thr Ala Ala Val Val Gin Ala Leu He Ser Gly Arg 
1605 1610 1615 



172 



Tyr Pro Gin Lys Thr Lys Leu Ser Ser Asp Ala Ser Lys Gly Tyr Ser 
1620 1625 1630 



Arg Thr Lys Gly Cys Ser Gin Ser Thr Ser Phe Pro Ala Pro Ser Ala 
1635 1640 1645 



Asp Tyr Gin Ala Arg Asp Cys Gin Thr Val Arg Val Cys Arg Ala Ala 
1650 1655 1660 



Ala Glu Met Ala Arg Ser Cys lie His Glu Pro Leu Ala Ser Ser Ala 
1665 1670 1675 1680 



173 



Ala Ser Ala Asp Leu Lys Arg He Arg Ser Thr Ser Asp Ser Val Pro 
1685 1690 1695 

Asp Val Lys He Ser Lys Ser Ala 
5 1700 



(2) INFORMATION FOR SEQ ID NO: 41: 



10 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(il) MOLECULE TYPE: DNA 

( ix ) FEATURE : 
20 (A) NAME /KEY: CDS 

(B) LOCATION: 4218,-4512 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



25 



GTTCTGCCTC CCCCGGACGG TAAATATAGG GGAACAATGT ACGCGAAAGC GACAGACGTG 60 



GCGCGTGTCT ACGCCGCGGC AGATGTCGCC TACGCGAACG TACTGCAGCA GAGAGCAGTC 12 0 



30 AAGTTGGACT TCGCCCCGCC ACTGAAGGCA CTAGAAACCC TCCACAGACT GTACTATCCG 180 



CTGCGCTTCA AAGGGGGCAC TTTACCCCCG ACACAACACC CGATCCTGGC CGGGCACCAA 240 



35 



CGTGTCGCAG AAGAGGTTCT GCACAATTTC GCCAGGGGAC GTAGCACAGT GCTCGAGATA 
GGGCCGTCTC TGCACAGCGC ACTTAAGCTA CATGGGGCAC CGAACGCCCC CGTCGCAGAC 



300 



360 



174 



TATCACGGGT GCACCAAGTA CGGCACCCGC GACGGCTCGC GACACATTAC GGCCTTAGAG 420 



TCTAGATCCG TCGCCACAGG CCGGCCCGAG TTCAAGGCCG ACGCCTCACT GCTCGCCAAC 480 



5 GGCATTGCCT CCCGCACCTT CTGCGTCGAC GGAGTCGGCT CTTGCGCGTT CAAATCGCGC 540 



GTTGGAATTG CCAATCACTC CCTCTATGAC GTGACCCTAG AGGAGCTGGC CAATGCGTTT 60 0 



10 



15 



20 



25 



30 



35 



GAGAAC C AC G GACTTCACAT GGTCCGCGCG TTCATGCACA TGCCAGAAGA GCTGCTCTAC 6 60 



ATGGACAACG TGGTTAATGC CGAGCTCGGC TACCGCTTCC AC GT TAT T GA AGAGCCTATG 72 0 



GC TGTGAAGG ACTGCGCATT CCAGGGGGGG GACCTCCGTC TCCACTTCCC TGAGTTGGAC 730 



TTCATCAACG AGAGCCAAGA GCGGCGCATC GAGAGGCTGG CCGCCCGCGG CTCCTACTCC 840 



AGACGCGCCG TCATTTTCTC CGGCGACGAC GACTGGGGTG ATGCGTACTT ACACGACTTC 300 



CACACATGGC TCGCCTACCT ACTGGTGAGG AACTACCCCA CTCCGTTTGG TTTCTCACTC 9 60 



CATATAGAAG TCCAGAGGCG CCACGGCTCC AGCATTGAGC TGCGCATCAC TCGCGCGCCA 1020 



CCTGGAGACC GCATGCTGGC CGTCGTCCCA AGGACGTCCC AAGGCCTCTG CAGAATCCCA 1030 



AACATCTTTT ATTACGCCGA CGCGTCGGGC ACTGAGCATA AGACCATCCT TACGTCACAG 1140 



CACAAAGTCA ACATGCTGCT CAATTTTATG CAAACGCGTC CTGAGAAGGA ACTAGTCGAC 12 00 



ATGACCGTCT TGATGTCGTT CGCGCGCGCT AGGCTGCGCG CGATCGTGGT CGCCTCAGAA 1260 



GTCACCGAGA GCTCCTGGAA CATCTCACCG GCTGACCTGG TCCGCACTGT CGTGTCTCTT 1320 



TACGTCCTCC AC AT CATC GA GCGCCGAAGG GCTGCGGTCG CTGTCAAGAC CGCCAAGGAC 1380 



GACGTCTTTG GAGAGACTTC GTTCTGGGAG AGTCTCAAGC ACGTCTTGGG CTCCTGTTGC 1440 



175 

GGTCTGCGCA ACCTCAAAGG CACCGACGTC GTCTTTACTA AGCGCGTCGT CGATAAGTAC 1500 

CGAGTCCACT CGCTCGGAGA CATAATCTGC GACGTCCGCC TGTCCCCTGA ACAGGTCGGC 1560 

5 TTCCTGCCGT CCCGCGTACC ACCTGCCCGC GTCTTTCACG ACAGGGAAGA GCTTGAGGTC 162 0 

CTTCGCGAAG CTGGCTGCTA CAACGAACGT CCGGTACCTT CCACTCCTCC TGTGGAGGAG 168 0 

CCCCAAGGTT TCGACGCCGA CTTGTGGCAC GCGACCGCGG CCTCACTCCC CGAGTACCGC 17 4 0 

10 

GCCACCTTGC AGGCAGGTCT CAACACCGAC GTCAAGCAGC TCAAGATCAC CCTCGAGAAC 1800 



176 



10 



20 



25 



30 



GCCC T CAAGA CCATCGACGG GCTCACCCTC TCCCCAGTCA GAGGCCTCGA GATGTACGAG 18 60 



GGCCCGCCAG GCAGCGGCAA GACGGGCACC CTCATCGCCG CCCTTGAGGC CGCGGGCGGT 1S2 0 



AAAGCACTTT ACGTGGCACC CACCAGAGAA CTGAGAGAGG CTATGGACCG GCGGATCAAA 1980 



CCGCCGTCCG CCTCGGCTAC GCAACATGTC GCCCTTGCGA TTCTCCGTCG TGCCACCGCC 2040 



GAGGGCGCCC CTTTCGCTAC CGTGGTTATC GACGAGTGCT TCATGTTCCC GCTCGTGTAC 2100 



GTCGCGATCG TGCACGCCTT GTCCCCGAGC TCACGAATAG TCCTTGTAGG GGACGTCCAC 2160 



CAAATCGGGT TTATAGACTT CCAAGGCACA AGCGCGAACA TGCCGCTCGT TCGCGACGTC 2220 
15 GTTAAGCAGT GCCGTCGGCG CACTTTCAAC CAAACCAAGC GCTGTCCGGC CGACGTCGTT 2280 



GCCACCACGT TTTTCCAGAG CTTGTACCCC GGGTGCACAA CCACCTCAGG GTGCGTCGCA 23 4 0 



TCCATCAGCC ACGTCGCCCC AGACTACCGC AACAGCCAGG CGCAAACGCT CTGCTTCACG 2 400 



35 



CAGGAGGAAA AGTCGCGCCA CGGGGCTGAG GGCGCGATGA CTGTGCACGA AGCGCAGGGA 2 4 60 



CGCACTTTTG CGTCTGTCAT TCTGCATTAC AACGGCTCCA CAGCAGAGCA GAAGCTCCTC 2 520 



GCTGAGAAGT CGCACCTTCT AGTCGGCATC ACGCGCCACA CCAACCACCT GTACATCCGC 2 58 0 



GACCCGACAG GTGACATTGA GAGACAACTC AACCATAGCG CGAAAGCCGA GGTGTTTACA 2640 



GACATCCCTG CACCCCTGGA GATCACGACT GTCAAACCGA GTGAAGAGGT GCAGCGCAAC 27 00 



GAAGTGATGG CAACGATACC CCCGCAGAGT GCCACGCCGC ACGGAGCAAT CCATCTGCTC 27 60 



CGCAAGAACT TCGGGGACCA ACCCGACTGT GGCTGTGTCG CTTTGGCGAA GACCGGCTAC 2 8 20 



GAGGTGTTTG GCGGTCGTGC CAAAATCAAC GTAGAGCTTG CCGAACCCGA CGCGACCCCG 2 8 80 



10 



15 



20 



25 



30 



35 
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AAGCCGCATA GGGCGTTCCA GGAAGGGGTA CAGTGGGTCA AGGTCACCAA CGCGTCTAAC 2 940 



AAACACCAGG CGCTCCAGAC GCTGTTGTCC CGCTACACCA AGCGAAGCGC TGACCTGCCG 3000 



CTACACGAAG CTAAGGAGGA CGTCAAACGC ATGCTAAACT CGCTTGACCG ACATTGGGAC 3060 



TGGACTGTCA CTGAAGACGC CCGTGACCGA GCTGTCTTCG AGACCCAGCT CAAGTTCACC 3120 



CAACGCGGCG GCACCGTCGA AGACCTGCTG GAGCCAGACG ACCCCTACAT CCGTGACATA 318 0 



GACTTCCTTA TGAAGACTCA GCAGAAAGTG TCGCCCAAGC CGATCAATAC GGGCAAGGTC 32 4 0 



GGGCAGGGGA TCGCCGCTCA CTCAAAGTCT CTCAACTTCG TCCTCGCCGC TTGGATACGC 3300 



ATACTCGAGG AGATACTCCG TACCGGGAGC CGCACGGTCC GGTACAGCAA CGGTCTCCCC 3360 



GAC GAAGAAG AGGCCATGCT GCTCGAAGCG AAGATCAATC AAGTCCCACA CGCCACGTTC 3 420 



GTCTCGGCGG ACTGGACCGA GTTTGACACC GCCCACAATA ACACGAGTGA GCTGCTCTTC 3 480 



GCCGCCCTTT TAGAGCGCAT CGGCACGCCT GCAGCTGCCG TTAATCTATT CAGAGAACGG 354 0 



TGTGGGAAAC GCACCTTGCG AGCGAAGGGT CTAGGCTCCG TTGAAGTCGA CGGTCTGCTC 3600 



GACTCCGGCG CAGCTTGGAC GCCTTGCCGC AACACCATCT TCTCTGCCGC CGTCATGCTC 366 0 



ACGCTCTTCC GCGGCGTCAA GTTCGCAGCT TTCAAAGGCG ACGACTCGCT CCTCTGTGGT 37 20 



AGCCATTACC TCCGTTTCGA CGCTAGCCGC CTTCACATGG GCGAACGTTA CAAGACCAAA 37 8 0 



CATTTGAAGG TCGAGGTGCA GAAAATCGTG CCGTACATCG GACTCCTCGT CTCCGCTGAG 3 84 0 



CAGGTCGTCC TCGACCCTGT CAGGAGCGCT CTCAAGATAT TTGGGCGCTG CTACACAAGC 3 900 



GAACTCCTTT ACTCCAAGTA CGTGGAGGCT GT GAGAGAC A TCACCAAGGG CTGGAGTGAC 3 960 



178 



GCCCGCTACC ACAGCCTCCT GTGCCACATG TCAGCATGCT ACTACAATTA CGCGCCGGAG 402 0 

TCTGCGGCGT ACATCATCGA CGCTGTTGTT CGCTTTGGGC GCGGCGACTT CCCGTTTGAA 40S0 

5 CAACTGCGCG TGGTGCGTGC CCATGTGCAG GCACCCGACG CTTACAGCAG CACGTATCCG 414 0 

GCTAACGTGC GCGCATCGTG CCTTGACCAC GTCTTCGAGC CCCGCCAGGC CGCCGCCCCG 42 00 

GCAGGTTTCG TTGCGAC ATG TGC GAA GCC GGA AAC GCC TTC TTC ACT TAC 42 50 
10 Met Cys Glu Ala Gly Asn Ala Phe Phe Thr Tyr 

15 10 



179 



CGC GAA AGC TGG TGT TTC TGC GAC TAC AAG CCA CGT TGC GAC TGG GAC 4 2 93 

Arg Glu Ser Trp Cys Phe Cys Asp Tyr Lys Pro Arg Cys Asp Trp Asp 
15 20 25 

5 TGC GCC CCC GGA GTC TCC ATG GGA TGC ACC TGC AGC CAA CAG CTT TTC 4 346 

Cys Ala Pro Gly Val Ser Met Gly Cys Thr Cys Ssr Gin Gin Leu Phe 
30 35 40 

GGA GTT ATT GAC ACC GGA GAC CCC GTC CAC ATC ATC CTC GCC GTC ATC 439 4 

10 Gly Val He Asp Thr Gly Asp Pro Val His He He Leu Ala Val He 

45 50 55 



GTC TTC ATC GGA CTC CTC TAC ATC GTG TGG AAG GTC GCT CAG TGG TGG 
Val Phe He Gly Leu Leu Tyr He Val Trp Lys Val Ala Gin Trp Trp 
60 65 70 75 



15 



20 



25 



30 



35 



AGA CAC CGC AAG GAC CAC AGA AGA CTT GAA CAG CAG AAA GCC GCC TTC 4 4 90 
Arg His Arg Lys Asp His Arg Arg Leu Glu Gin Gin Lys Ala Ala Phe 
80 85 90 

GCA AGA CAG GCA ATC ACG CTC GTC TGAATGTC TGGACAGAAG CGGAGAAAGG 4 542 
Ala Arg Gin Ala He Thr Leu Val 
95 

ACAGGCAGTT CGTTAACTGC CCCCACTGCT CCGAGCCCCT CATTCTCATT TTCGGAAAGA 46 02 

GCTCGACTGG CGACCGGGCC GAC T GTC GCC GC TGC GAC AT CACCTTCGGC AACCCCATCC 4662 

TGCGCCACGG ACCAGGTTGC CGCGAGGACC ACGCCGGACT TTGCGCCTTT CCTGGGTTCC 4722 

CAGTCTGCCC GTGCTGTCTC GAAGCCGTAC CGGCCCCCCA CGACTGCCCG TTGGAAAGAA 47 82 

GTCACCCCGC TCCACGCGTG GAAGGGCGTG ACCGGAGACC GACCGGAAGT CAGGGAGGAC 43 42 

CCGGAGACAG CGGCGGTCGT CCAGGCTCTG ATCAGCGGCC GTTATCCTCA GAAGAC GAAG 4 9 02 



180 



CTTTCCTCCG ACGCATCCAA AGGCTACTCA AGAACTAAGG GATGCTCACA ATCCACCTCT 4 9 62 



TTTCCTGCCC CGAGTGCGGA TTACCAGGCC CGCGACTGCC AGACAGTCCG AGTCTGCCGC 5022 



5 GCCGCTGCAG AGATGGCGCG CTCATGTATT CACGAGCCGT TGGCTTCATC TGCCGCCAGT 5082 



GCCGACTTGA AGCGCATACG CTCTACCTCG GACTCTGTTC CCGATGTAAA GATCAGCAAG 5142 



10 



AGCGCATGAA GGAACAAAAT TAGTTTCCTT GTTCGTAAAC AAGGTGGTCC CTCCCATTGA 52 02 



GGTAAAGACT CTGGTGAGTC CTCAACGTTA CTCGTTGAGT CTGCTGCGGT TCGATTCCAT 52 62 



TCCCAAGCAG CAAAGGGTGC GCAACTAGTA CGGCGCCCCC TGGGATACCA 



15 



181 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

fia) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



Met Cys Glu Ala 
1 

Phe Cys Asp Tyr 
20 

Ser Met Gly Cys 
35 

Gly Asp Pro Val 
50 

Leu Tyr He Val 
65 

His Arg Arg Leu 



Gly Asn Ala Phe 
5 

Lys Pro Arg Cys 

Thr Cys Ser Gin 
40 

His He He Leu 
55 

Trp Lys Val Ala 
70 

Glu Gin Gin Lys 
85 



Phe Thr Tyr Arg 
10 

Asp Trp Asp Cys 
25 

Gin Leu Phe Gly 

Ala Val He Val 
60 

Gin Trp Trp Arg 
75 

Ala Ala Phe Ala 
90 



Glu Ser Trp Cys 
15 

Ala Pro Gly Val 
30 

Val He Asp Thr 
45 

Phe He Gly Leu 

His Arg Lys Asp 
80 

Arg Gin Ala He 
95 



Thr Leu Val 
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(2) INFORMATION FOR SEQ ID NO: 43: 

{ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5312 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



10 



15 



(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

{A} NAME /KEY: CDS 
(B) LOCATION: 4518.. 4937 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

GTTCTGCCTC CCCCGGACGG TAAATATAGG GGAACAATGT AC GCGAAAGC GACAGACGTG 60 
20 GCGCGTGTCT ACGCCGCGGC AGATGTCGCC TACGCGAACG TACTGCAGCA GAGAGCAGTC 120 

AAGTTGGACT TCGCCCCGCC ACTGAAGGCA CTAGAAACCC TCCACAGACT GTACTATCCG 180 



25 



30 



35 



CTGCGCTTCA AAGGGGGCAC TTTACCCCCG ACACAACACC CGATCCTGGC CGGGCACCAA 2 40 



CGTGTCGCAG AAGAGGTTCT GCACAATTTC GCCAGGGGAC GTAGCACAGT GCTCGAGATA 300 



GGGCCGTC TC TGCACAGCGC ACTTAAGCTA CATGGGGCAC CGAACGCCCC CGTCGCAGAC 36 



TATCACGGGT GCACCAAGTA CGGCACCCGC GACGGCTCGC GACACATTAC GGCCTTAGAG 420 



TCTAGATCCG TCGCCACAGG CCGGCCCGAG TTCAAGGCCG ACGCCTCACT GCTCGCCAAC 4 80 



GGCATTGCCT CCCGCACCTT CTGCGTCGAC GGAGTCGGCT CTTGCGCGTT CAAATCGCGC 540 



GTTGGAATTG CCAATCACTC CCTCTATGAC GTGACCCTAG AGGAGCTGGC CAATGCGTTT 600 
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GAGAACCACG GACTTCACAT GGTCCGCGCG TTCATGCACA TGCCAGAAGA GCTGCTCTAC 66 0 



ATGGACAACG TGGTTAATGC CGAGCTCGGC TACCGCTTCC ACGTTATTGA AGAGCCTATG 72 0 



5 GCTGTGAAGG ACTGCGCATT CCAGGGGGGG GACCTCCGTC TCCACTTCCC TGAGTTGGAC 7 80 



TTCATCAACG AGAGCCAAGA GCGGCGCATC GAGAGGCTGG CCGCCCGCGG CTCCTACTCC 840 



AGACGCGCCG TCATTTTCTC CGGCGACGAC GACTGGGGTG ATGCGTACTT ACACGACTTC 900 



10 



184 



CACACATGGC TCGCCTACCT ACTGGTGAGG AACTACCCCA CTCCGTTTGG TTTCTCACTC 960 



CATATAGAAG TCCAGAGGCG CCACGGCTCC AGCATTGAGC TGCGCATCAC TCGCGCGCCA 1020 



5 CCTGGAGACC GCATGCTGGC CGTCGTCCCA AGGACGTCCC AAGGCCTCTG CAGAATCCCA 108 0 



AACATCTTTT ATTACGCCGA CGCGTCGGGC ACTGAGCATA AGACCATCCT TACGTCACAG 114 0 



CACAAAGTCA ACATGCTGCT CAATTTTATG CAAACGCGTC CTGAGAAGGA ACTAGTCGAC 1200 

10 

ATGACCGTCT TGATGTCGTT CGCGCGCGCT AGGCTGCGCG CGATCGTGGT CGCCTCAGAA 12 60 



GTCACCGAGA GCTCCTGGAA CATCTCACCG GCTGACCTGG TCCGCACTGT CGTGTCTCTT 1320 



15 TACGTCCTCC ACATCATCGA GCGCCGAAGG GCTGCGGTCG CTGTCAAGAC CGCCAAGGAC 138 0 



GACGTCTTTG GAGAGACTTC GTTCTGGGAG AGTCTCAAGC ACGTCTTGGG CTCCTGTTGC 144 0 



GGTCTGCGCA ACCTCAAAGG CACCGACGTC GTCTTTACTA AGCGCGTCGT CGATAAGTAC 150 0 

20 

CGAGTCCACT CGCTCGGAGA CATAATCTGC GACGTCCGCC TGTCCCCTGA ACAGGTCGGC 1560 



TTCCTGCCGT CCCGCGTACC ACCTGCCCGC GTCTTTCACG ACAGGGAAGA GCTTGAGGTC 162 0 



25 CTTCGCGAAG CTGGCTGCTA CAACGAAC GT CCGGTACCTT CCACTCCTCC TGTGGAGGAG 163 0 



CCCCAAGGTT TCGACGCCGA CTTGTGGCAC GCGACCGCGG CCTCACTCCC CGAGTACCGC 174 0 



GCCACCTTGC AGGCAGGTCT CAACACCGAC GTCAAGCAGC TCAAGATCAC CCTCGAGAAC 1800 

30 

GCCCTCAAGA CCATCGACGG GCTCACCCTC TCCCCAGTCA GAGGCCTCGA GAT GT AC GAG 1860 



GGCCCGCCAG GCAGCGGCAA GACGGGCACC CTCATCGCCG CCCTTGAGGC CGCGGGCGGT 192 0 



35 AAAGCACTTT ACGTGGCACC CACCAGAGAA CTGAGAGAGG CTATGGACCG GCGGATCAAA 198 0 



185 



CCGCCGTCCG CCTCGGCTAC GCAACATGTC GCCCTTGCGA TTCTCCGTCG TGCCACCGCC 204 0 



GAGGGCGCCC CTTTCGCTAC CGTGGTTATC GACGAGTGCT TCATGTTCCC GCTCGTGTAC 2100 



5 GTCGCGATCG TGCACGCCTT GTCCCCGAGC TCACGAATAG TCCTTGTAGG GGACGTCCAC 2160 



CAAATCGGGT TTATAGACTT CCAAGGCACA AGCGCGAACA TGCCGCTCGT TCGCGACGTC 222 0 



10 



GTTAAGCAGT GCCGTCGGCG CACTTTCAAC CAAACCAAGC GCTGTCCGGC CGACGTCGTT 228 0 



GCCACCACGT TTTTCCAGAG CTTGTACCCC GGGTGCACAA CCACCTCAGG GTGCGTCGCA 2340 



TCCATCAGCC ACGTCGCCCC AGACTACCGC AACAGCCAGG CGCAAACGCT CTGCTTCACG 2 400 



15 CAGGAGGAAA AGTCGCGCCA CGGGGCTGAG GGCGCGATGA CTGTGCACGA AGCGCAGGGA 2 460 



CGCACTTTTG CGTCTGTCAT TCTGCATTAC AACGGCTCCA CAGCAGAGCA GAAGCTCCTC 2 520 



20 



25 



30 



35 



GCTGAGAAGT CGCACCTTCT AGTCGGCATC ACGCGCCACA CCAACCACCT GTACATCCGC 258 0 



GACCC GACAG GTGACATTGA GAGACAACTC AACCATAGCG CGAAAGCCGA GGTGTTTACA 2 6 40 



GACATCCCTG CACCCCTGGA GATCACGACT GTCAAACCGA GTGAAGAGGT GCAGCGCAAC 27 00 



GAAGTGATGG CAACGATACC CCCGCAGAGT GCCACGCCGC ACGGAGCAAT CCATCTGCTC 2760 



CGCAAGAACT TCGGGGACCA ACCCGACTGT GGCTGTGTCG CTTTGGCGAA GACCGGCTAC 2820 



GAGGTGTTTG GCGGTCGTGC CAAAATCAAC GTAGAGCTTG CCGAACCCGA CGCGACCCCG 28 80 



AAGCCGCATA GGGCGTTCCA GGAAGGGGTA CAGTGGGTCA AGGTCACCAA CGCGTCTAAC 2 94 0 



AAACACCAGG CGCTCCAGAC GCTGTTGTCC CGCTACACCA AGCGAAGCGC TGACCTGCCG 3000 



CTACACGAAG CTAAGGAGGA CGTCAAACGC ATGCTAAACT CGCTTGACCG ACATTGGGAC 3060 



186 



TGGACTGTCA CTGAAGACGC CCGTGAC CGA GCTGTCTTCG AGACCCAGCT CAAGTTCACC 



CAACGCGGCG GCACCGTCGA AGACCTGCTG GAGCCAGACG ACCCCTACAT CCGTGAC AT A 



5 GACTTCCTTA TGAAGACTCA GCAGAAAGTG TCGCCCAAGC CGATCAATAC GGGCAAGGTC 



GGGCAGGGGA TCGCCGCTCA CTCAAAGTCT CTCAACTTCG TCCTCGCCGC TTGGATACGC 



ATACTCGAGG AGATACTCCG TACCGGGAGC CGCACGGTCC GGTACAGCAA CGGTCTCCCC 

10 

GACGAAGAAG AGGCCATGCT GCTCGAAGCG AAGATCAATC AAGTCCCACA CGCCACGTTC 



187 



GTCTCGGCGG ACTGGACCGA GTTTGACACC GCCCACAATA ACACGAGTGA GCTGCTCTTC 34 8 0 



GCCGCCCTTT TAGAGC GC AT CGGCACGCCT GCAGCTGCCG TTAATCTATT CAGAGAACGG 3540 



5 TGTGGGAAAC GCACCTTGCG AGCGAAGGGT CTAGGCTCCG TTGAAGTCGA CGGTCTGCTC 3600 



GACTCCGGCG CAGCTTGGAC GCCTTGCCGC AACACCATCT TCTCTGCCGC CGTCATGCTC 3660 



10 



ACGCTCTTCC GCGGCGTCAA GTTCGCAGCT TTCAAAGGCG ACGACTCGCT CCTCTGTGGT 37 20 



AGCCATTACC TCCGTTTCGA CGCTAGCCGC CTTCACATGG GCGAACGTTA CAAGACCAAA 37 SO 



CATTTGAAGG TCGAGGTGCA GAAAATCGTG CCGTACATCG GACTCCTCGT CTCCGCTGAG 38 40 



15 CAGGTCGTCC TCGACCCTGT C AGGAGC GC T CTCAAGATAT TTGGGCGCTG CTACACAAGC 3900 



GAACTCCTTT ACTCCAAGTA CGTGGAGGCT GT G AGAGAC A TCACCAAGGG CTGGAGTGAC 3960 



20 



GCCCGCTACC ACAGCCTCCT GTGCCACATG TCAGCATGCT ACTACAATTA CGCGCCGGAG 4020 



TCTGCGGCGT AC AT CAT C GA CGCTGTTGTT CGCTTTGGGC GCGGCGACTT CCCGTTTGAA 4 080 



CAACTGCGCG TGGTGCGTGC CCATGTGCAG GCACCCGACG CTTACAGCAG CACGTATCCG 4140 



25 GCTAACGTGC GCGCATCGTG CCTTGACCA.C GTCTTCGAGC CCCGCCAGGC CGCCGCCCCG 42 0 0 



GCAGGTTTCG TTGCGACATG TGCGAAGCCG GAAACGCCTT CTTCACTTAC CGCGAAAGCT 42 60 



30 



GGTGTTTCTG CGACTACAAG CCACGTTGCG ACTGGGACTG CGCCCCCGGA GTCTCCATGG 4 32 0 



GATGCACCTG CAGCCAACAG CTTTTCGGAG TTATTGACAC CGGAGACCCC GTCCACATCA 4 33 0 



TCCTCGCCGT CATCGTCTTC ATCGGACTCC TCTACATCGT GTGGAAGGTC GCTCAGTGGT 4 44 0 



35 GGAGACACCG CAAGGACCAC AGAAGACTTG AACAGCAGAA AGCCGCCTTC GCAAGACAGG 4 500 



188 



CAATCACGCT CGTCTGA ATG TCT GGA CAG AAG CGG AGA AAG GAC AGG CAG 4 55 0 

Met Ser Gly Gin Lys Arg Arg Lys Asp Arg Gin 
15 10 

5 TTC GTT AAC TGC CCC CAC TGC TCC GAG CCC CTC ATT CTC ATT TTC GGA 4 596 

Phe Val Asn Cys Pro His Cys Ser Glu Pro Leu lie Leu lie Phe Gly 
15 20 25 

AAG AGC TCG ACT GGC GAC CGG GCC GAC TGT CGC CGC TGC GAC ATC ACC 4 646 

10 Lys Ser Ser Thr Gly Asp Arg Ala Asp Cys Arg Arg Cys Asp lie Thr 

30 35 40 

TTC GGC AAC CCC ATC CTG CGC CAC GGA CCA GGT TGC CGC GAG GAC CAC 4694 
Phe Gly Asn Pro He Leu Arg His Gly Pro Gly Cys Arg Glu Asp His 
15 45 50 55 

GCC GGA CTT TGC GCC TTT CCT GGG TTC CCA GTC TGC CCG TGC TGT CTC 47 42 

Ala Gly Leu Cys Ala Phe Pro Gly Phe Pro Val Cys Pro Cys Cys Leu 
60 65 70 75 

20 

GAA GCC GTA CCG GCC CCC CAC GAC TGC CCG TTG GAA AGA AGT CAC CCC 47 90 

Glu Ala Val Pro Ala Pro His Asp Cys Pro Leu Glu Arg Ser His Pro 
80 85 50 

25 GCT CCA CGC GTG GAA GGG CGT GAC CGG AGA CCG ACC GGA AGT CAG GGA 4 8 38 

Ala Pro Arg Val Glu Gly Arg Asp Arg Arg Pro Thr Gly Ser Gin Gly 
95 100 105 

GGA CCC GGA GAC AGC GGC GGT CGT CCA GGC TCT GAT CAG CGG CCG TTA 4886 
30 Gly Pro Gly Asp Ser Gly Gly Arg Pro Gly Ser Asp Gin Arg Pro Leu 

110 115 120 

TCC TCA GAA GAC GAA GCT TTC CTC CGA CGC ATC CAA AGG CTA CTC AAG 4 934 

Ser Ser Glu Asp Glu Ala Phe Leu Arg Arg lie Gin Arg Leu Leu Lys 
35 125 130 135 



189 



AAC TAAGGGATGC TC ACAATC CA CCTCTTTTCC TGCCCCGAGT GCGGATTACC 4 937 

Asn 

140 

5 AGGCCCGCGA CTGCCAGACA GTCCGAGTCT GCCGCGCCGC TGCAGAGATG GCGCGCTCAT 5047 

GTATTCACGA GCCGTTGGCT TCATCTGCCG CCAGTGCCGA CTTGAAGCGC ATACGCTCTA 5107 
CCTCGGACTC TGTTCCCGAT GTAAAGATCA GCAAGAGCGC ATGAAGGAAC AAAATTAGTT 5167 

10 

TCCTTGTTCG TAAACAAGGT GGTCCCTCCC ATTGAGGTAA AGACTCTGGT GAGTCCTCAA 5227 



190 



15 



CGTTACTCGT TGAGTCTGCT GCGGTTCGAT TCCATTCCCA AGCAGCAAAG GGTGCGCAAC 5287 



TAGTACGGCG CCCCCTGGGA TACCA 



(2) INFORMATION FOR SEQ ID NO: 44: 

{i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii} MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



Met Ser Gly Gin Lys Arg Arg Lys Asp Arg Gin Phe Val Asn Cys Pro 
20 1 5 10 15 

His Cys Ser Glu Pro Leu He Leu He Phe Gly Lys Ser Ser Thr Gly 
20 25 30 

25 Asp Arg Ala Asp Cys Arg Arg Cys Asp He Thr Phe Gly Asn Pro He 

35 40 45 

Leu Arg His Gly Pro Gly Cys Arg Glu Asp His Ala Gly Leu Cys Ala 
50 55 60 

30 

Phe Pro Gly Phe Pro Val Cys Pro Cys Cys Leu Glu Ala Val Pro Ala 
65 70 75 80 

Pro His Asp Cys Pro Leu Glu Arg Ser His Pro Ala Pro Arg Val Glu 
35 S5 90 95 



191 



Gly Arg Asp Arg Arg Pro Thr Gly Ser Gin Gly Gly Pro Gly Asp Ser 
100 105 110 

Gly Gly Arg Pro Gly Ser Asp Gin Arg Pro Leu Ser Ser Glu Asp Glu 
5 115 120 125 

Ala Phe Leu Arg Arg He Gin Arg Leu Leu Lys Asn 
130 135 140 



10 



(2) INFORMATION FOR SEQ ID NO: 45: 



15 



(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 5312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D} TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: DNA 



25 



(IX) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 4944.. 5162 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 45: 



30 



GTTCTGCCTC CCCCGGACGG TAAATATAGG GGAACAATGT AC GC GAAAGC GACAGACGTG 60 



GCGCGTGTCT ACGCCGCGGC AGATGTCGCC TACGCGAACG TACTGCAGCA GAGAGCAGTC 120 



AAGTTGGACT TCGCCCCGCC AC TGAAGGCA CTAGAAACCC TCCACAGACT GTACTATCCG 180 



35 



CTGCGCTTCA AAGGGGGCAC TTTACCCCCG ACACAACACC CGATCCTGGC CGGGCACCAA 



240 



192 

CGTGTCGCAG AAGAGGTTCT GCACAATTTC GCCAGGGGAC GTAGCACAGT GCTCGAGATA 
GGGCCGTCTC TGCACAGCGC ACTTAAGCTA CATGGGGCAC CGAACGCCCC CGTCGCAGAC 
TATCACGGGT GCACCAAGTA CGGCACCCGC GACGGCTCGC GACACATTAC GGCCTTAGAG 
TCTAGATCCG TCGCCACAGG CCGGCCCGAG TTCAAGGCCG ACGCCTCACT GCTCGCCAAC 
GGCATTGCCT CCCGCACCTT CTGCGTCGAC GGAGTCGGCT CTTGCGCGTT CAAATCGCGC 
GTTGGAATTG CCAATCACTC CCTCTATGAC GT GACCC TAG AGGAGCTGGC CAATGCGTTT 



193 

GAGAACC AC G GACTTCACAT GGTCCGCGCG TTCATGCACA TGCCAGAAGA GCTGCTCTAC 660 

ATGGACAACG TGGTTAATGC CGAGCTCGGC TACCGCTTCC ACGTTATTGA AGAGCC TAT G 720 

5 GCTGTGAAGG ACTGCGCATT CCAGGGGGGG GACCTCCGTC TCCACTTCCC TGAGTTGGAC 78 0 

TTCATCAACG AGAGCCAAGA GCGGCGCATC GAGAGGCTGG CCGCCCGCGG CTCCTACTCC 6 40 

AGACGCGCCG TCATTTTCTC CGGCGACGAC GACTGGGGTG ATGCGTACTT ACACGACTTC 900 

CACACATGGC TCGCCTACCT ACTGGTGAGG AACTACCCCA CTCCGTTTGG TTTCTCACTC 960 

CATATAGAAG TCCAGAGGCG CCACGGCTCC AGCATTGAGC TGCGCATCAC TCGCGCGCCA 1020 

15 CC TGGAGACC GCATGCTGGC CGTCGTCCCA AGGACGTCCC AAGGCCTCTG CAGAATCCCA 108 0 

AACATCTTTT ATTACGCCGA CGCGTCGGGC AC T GAGC AT A AGACCATCCT TACGTCACAG 1140 



10 



20 



CACAAAGTCA ACATGCTGCT CAATTTTATG CAAACGCGTC CTGAGAAGGA ACTAGTCGAC 1200 

ATGACCGTCT TGATGTCGTT CGCGCGCGCT AGGCTGCGCG CGATCGTGGT CGCCTCAGAA 12 60 

GTCACCGAGA GCTCCTGGAA CATCTCACCG GCTGACCTGG TCCGCACTGT CGTGTCTCTT 1320 

25 TACGTCCTCC ACATCATCGA GCGCCGAAGG GCTGCGGTCG CTGTCAAGAC CGCCAAGGAC 1380 

GACGTCTTTG GAGAGACTTC GTTCTGGGAG AGTCTCAAGC ACGTCTTGGG CTCCTGTTGC 14 40 



30 



GGTCTGCGCA ACCTCAAAGG CACCGACGTC GTCTTTACTA AGCGCGTCGT CGATAAGTAC 1500 



CGAGTCCACT CGCTCGGAGA CATAATCTGC GACGTCCGCC TGTCCCCTGA ACAGGTCGGC 1560 



TTCCTGCCGT CCCGCGTACC ACCTGCCCGC GTCTTTCACG ACAGGGAAGA GCTTGAGGTC 162 0 



35 CTTCGCGAAG CTGGCTGCTA CAACGAACGT CC GGTACCTT CCACTCCTCC TGTGGAGGAG 



1680 



10 



194 

CCCCAAGGTT TCGACGCCGA CTTGTGGCAC GCGACCGCGG CCTCACTCCC CGAGTACCGC 17 4 0 

GCCACCTTGC AGGCAGGTCT CAACACCGAC GTCAAGCAGC TCAAGATCAC CCTCGAGAAC 18 0 0 

5 GCCCTCAAGA CCATCGACGG GCTCACCCTC TCCCCAGTCA GAGGCCTCGA GAT GT AC GAG 1860 

GGCCCGCCAG GCAGCGGCAA GACGGGCACC CTCATCGCCG CCCTTGAGGC CGCGGGCGGT 1920 

AAAGCACTTT ACGTGGCACC C AC C AGAGAA CTGAGAGAGG CTATGGACCG GC GGATC AAA 193 0 

CCGCCGTCCG CCTCGGCTAC GCAACATGTC GCCCTTGCGA TTCTCCGTCG TGCCACCGCC 2 04 0 

GAGGGCGCCC CTTTCGCTAC CGTGGTTATC GACGAGTGCT TCATGTTCCC GCTCGTGTAC 2100 

15 GTCGCGATCG TGCACGCCTT GTCCCCGAGC TCACGAATAG TCCTTGTAGG GGACGTCCAC 2160 

CAAATCGGGT TTATAGACTT CCAAGGCACA AGCGCGAACA TGCCGCTCGT TCGCGACGTC 2220 



GTTAAGCAGT GCCGTCGGCG CACTTTCAAC CAAACCAAGC GCTGTCCGGC CGACGTCGTT 22 8 0 

20 

GCCACCACGT TTTTCCAGAG CTTGTACCCC GGGTGCACAA CCACCTCAGG GTGCGTCGCA 234 0 



TCCATCAGCC ACGTCGCCCC AGACTACCGC AACAGCCAGG CGCAAACGCT CTGCTTCACG 2 4 00 



25 CAGGAGGAAA AGTCGCGCCA CGGGGCTGAG GGCGCGATGA CTGTGCACGA AGCGCAGGGA 24 60 



CGCACTTTTG CGTCTGTCAT TCTGCATTAC AACGGCTCCA CAGCAGAGCA GAAGCTCCTC 2 520 



GCTGAGAAGT CGCACCTTCT AGTCGGCATC ACGCGCCACA CCAACCACCT GTACATCCGC 2580 

30 

GACCCGACAG GTGACATTGA GAGACAACTC AACCATAGCG CGAAAGCCGA GGTGTTTACA 2640 



GACATCCCTG CACCCCTGGA GATCACGACT GTCAAACCGA GTGAAGAGGT GCAGCGCAAC 2700 



35 GAAGTGATGG C AAC GAT AC C CCCGCAGAGT GCCACGCCGC ACGGAGCAAT CCATCTGCTC 2760 
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CGCAAGAACT TCGGGGACCA ACCCGACTGT GGCTGTGTCG CTTTGGCGAA GACCGGCTAC 28 20 

GAGGTGTTTG GCGGTCGTGC CAAAATCAAC GTAGAGCTTG CCGAACCCGA CGCGACCCCG 2880 

5 AAGCCGCATA GGGCGTTCCA GGAAGGGGTA CAGTGGGTCA AGGTCACCAA CGCGTCTAAC 2 940 

AAACACCAGG CGCTCCAGAC GCTGTTGTCC CGCTACACCA AGCGAAGCGC TGACCTGCCG 3000 



10 



CTACACGAAG CTAAGGAGGA CGTCAAACGC ATGCTAAACT CGCTTGACCG ACATTGGGAC 3060 



TGGACTGTCA CTGAAGACGC CCGTGACCGA GCTGTCTTCG AGACCCAGCT CAAGTTCACC 3120 



196 



CAACGCGGCG GCACCGTCGA AGACCTGCTG GAGCCAGACG ACCCCTACAT CCGTGACATA 318 0 



GACTTCCTTA TGAAGACTCA GCAGAAAGTG TCGCCCAAGC CGATCAATAC GGGCAAGGTC 3240 



5 GGGCAGGGGA TCGCCGCTCA CTCAAAGTCT CTCAACTTCG TCCTCGCCGC TTGGATACGC 3 300 



ATACTCGAGG AGATACTCCG TACC GGGAGC CGCACGGTCC GGTACAGCAA CGGTCTCCCC 3360 



GACGAAGAAG AGGCCATGCT GCTCGAAGCG AAGATCAATC AAGTCCCACA CGCCACGTTC 3420 

10 

GTCTCGGCGG ACTGGACCGA GTTTGACACC GCCCACAATA ACACGAGTGA GCTGCTCTTC 34 80 



GCCGCCCTTT TAGAGCGCAT CGGCACGCCT GCAGCTGCCG TTAATCTATT CAGAGAACGG 354 0 



15 TGTGGGAAAC GCACCTTGCG AGC GAAGGGT CTAGGCTCCG TTGAAGTCGA CGGTCTGCTC 3600 



GACTCCGGCG CAGCTTGGAC GCCTTGCCGC AACACCATCT TCTCTGCCGC CGTCATGCTC 3660 



ACGCTCTTCC GCGGCGTCAA GTTCGCAGCT TTCAAAGGCG ACGACTCGCT CCTCTGTGGT 37 2 0 

20 

AGCCATTACC TCCGTTTCGA CGCTAGCCGC CTTCACATGG GCGAACGTTA CAAGACCAAA 37 80 



CAT T T GAAGG TCGAGGTGCA GAAAATCGTG CCGTACATCG GACTCCTCGT CTCCGCTGAG 38 4 0 



25 CAGGTCGTCC TCGACCCTGT CAGGAGCGCT CTCAAGATAT TTGGGCGCTG CTACACAAGC 3 900 



GAACTCCTTT ACTCCAAGTA CGTGGAGGCT GT GAGAGAC A TCACCAAGGG CTGGAGTGAC 3960 



GCCCGCTACC ACAGCCTCCT GTGCCACATG TCAGCATGCT ACTACAATTA C GCGC CGGAG 4 02 0 

30 

TCTGCGGCGT ACATCATCGA CGCTGTTGTT CGCTTTGGGC GCGGCGACTT CCCGTTTGAA 4 080 



CAACTGCGCG TGGTGCGTGC CC AT GT GC AG GCACCCGACG CTTACAGCAG CACGTATCCG 414 0 



35 GCTAACGTGC GCGCATCGTG CCTTGACCAC GTCTTCGAGC CCCGCCAGGC CGCCGCCCCG 4200 



197 



GCAGGTTTCG TTGCGACATG TGCGAAGCCG GAAACGCCTT CTTCACTTAC CGCGAAAGCT 42 60 



GGTGTTTCTG CGACTACAAG CCACGTTGCG AC T GGGAC T G CGCCCCCGGA GTCTCCATGG 4 320 



5 GATGCACCTG CAGCCAACAG CTTTTCGGAG TTATTGACAC CGGAGACCCC GTCCACATCA 4 38 0 



TCCTCGCCGT CATCGTCTTC ATCGGACTCC TCTACATCGT GTGGAAGGTC GCTCAGTGGT 4 440 



10 



GGAGACACCG CAAGGACCAC AGAAGACTTG AACAGCAGAA AGCCGCCTTC GCAAGACAGG 4 500 



CAATCACGCT CGTCTGAATG TCTGGACAGA AGCGGAGAAA GGACAGGCAG TTCGTTAACT 4 560 



GCCCCCACTG CTCCGAGCCC CTCATTCTCA TTTTCGGAAA GAGCTCGACT GGCGACCGGG 4 620 
15 CCGACTGTCG CCGCTGCGAC ATCACCTTCG GCAACCCCAT CCTGCGCCAC GGACCAGGTT 4 6S0 



GCCGCGAGGA CCACGCCGGA CTTTGCGCCT TTCCTGGGTT CCCAGTCTGC CCGTGCTGTC 4740 

TCGAAGCCGT ACCGGCCCCC CACGACTGCC CGTTGGAAAG AAGTCACCCC GCTCCACGCG 4 30 0 

20 

TGGAAGGGCG TGACCGGAGA CCGACCGGAA GTCAGGGAGG ACCCGGAGAC AGCGGCGGTC 4 3 60 

GTCCAGGCTC TGATCAGCGG CCGTTATCCT CAGAAGACGA AGCTTTCCTC CGACGCATCC 4 92 0 

25 AAAGGCTACT CAAGAACTAA GGG ATG CTC ACA ATC CAC CTC TTT TCC TGC 4 97 0 

Met Leu Thr lie His Leu Phe Ser Cys 
1 5 

CCC GAG TGC GGA TTA CCA GGC CCG CGA CTG CCA GAC AGT CCG AGT CTG 5018 
30 Pro Glu Cys Gly Leu Pro Gly Pro Arg Leu Pro Asp Ser Pro Ser Leu 

10 15 20 25 

CCG CGC CGC TGC AGA GAT GGC GCG CTC ATG TAT TCA CGA GCC GTT GGC 5066 
Pro Arg Arg Cys Arg Asp Gly Ala Leu Met Tyr Ser Arg Ala Val Gly 
35 30 35 40 



198 



TTC ATC TGC CGC CAG TGC CGA CTT GAA GCG CAT ACG CTC TAC CTC GGA 
Phe He Cys Arg Gin Cys Arg Leu Glu Ala His Thr Leu Tyr Leu Gly 
45 50 55 



CTC TGT TCC CGA TGT AAA GAT CAG CAA GAG CGC ATG AAG GAA CAA AAT 
Leu Cys Ser Arg Cys Lys Asp Gin Gin Glu Arg Met Lys Glu Gin Asn 
60 65 70 



5162 



10 



15 



TAGTTTCCTT GTTCGTAAAC AAGGTGGTCC CTCCCATTGA GGTAAAGACT CTGGTGAGTC 5222 



CTCAACGTTA CTCGTTGAGT CTGCTGCGGT TCGATTCCAT TCCCAAGCAG CAAAGGGTGC 5282 



GCAACTAGTA CGGCGCCCCC TGGGATACCA 



(2) INFORMATION FOR SEQ ID NO: 46: 



20 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



25 



(11) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



Met Leu Thr He His Leu Phe Ser Cys Pro Glu Cys Gly Leu Pro Gly 
30 1 5 10 15 

Pro Arg Leu Pro Asp Ser Pro Ser Leu Pro Arg Arg Cys Arg Asp Gly 
20 25 30 



35 Ala Leu Met Tyr Ser Arg Ala Val Gly Phe He Cys Arg Gin Cys Arg 

35 40 45 



199 



Leu Glu Ala His Thr Leu Tyr Leu Gly Leu Cys Ser Arg Cys Lys Asp 
50 55 60 

Gin Gin Glu Arg Met Lys Glu Gin Asn 
5 65 70 



10 



(2) INFORMATION FOR SEQ ID NO: 47: 



(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2478 base pairs 

(B) TYPE: nucleic acid 

(C) S TRANDE DNE S S : single 
15 (D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 

(ix) FEATURE: 
20 (A) NAME/KEY: CDS 

(B) LOCATION: 283.. 753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

25 

GTTTTTCTTT CTTTACCAAG TGTGGTAAAA T T TAAAC AAA GAAGAAAACC AGGACCGTAA 60 

CCCGGCCCTT ACACACCTCG AGTCCGTGAC CACCGGATTA TACGTCGCCC ACCACACGGC 120 

30 GCCTTTTCCG ACCACTCTCG AGAGTCGTTG GGAGTTTCGT CCGTGACCAC CCGGTTGGCA 180 

GTCGACAGAC GCTTCCGGAC CACTAGAACC TCCTCGAGCG ACGCACACAC AGCACACACA 2 40 

CCGCCTTAGC TGCACCTACG GCAGCGTTGA TAGCGCGGAT TT ATG AGC GAG CAC 2 94 

35 Met Ser Glu His 



200 



ACC ATC GCC CAC TCC ATC ACA TTA CCA CCC GGT TAC ACC CTT GCC CTA 34 2 

Thr lie Ala His Ser He Thr Leu Pro Pro Gly Tyr Thr Leu Ala Leu 
5 10 15 20 

5 ATA CCC CCT GAA CCT GAA GCA GGA TGG GAG ATG CTG GAG TGG CGT CAC 3 90 

He Pro Pro Glu Pro Glu Ala Gly Trp Glu Met Leu Glu Trp Arg His 
25 30 35 

AGC GAC CTC ACA ACC GTC GCG GAA CCC GTA ACG TTC GGG TCA GCG CCA 433 
10 Ser Asp Leu Thr Thr Val Ala Glu Pro Val Thr Phe Gly Ser Ala Pro 

40 45 50 

ACA CCG TCA CCG TCA ATG GTA GAA GAA ACC AAC GGC GTC GGA CCG GAA 486 
Thr Pro Ser Pro Ser Met Val Glu Glu Thr Asn Gly Val Gly Pro Glu 
15 55 60 65 



201 



GGC AAG TTT CTC CCC CTG ACA ATT TCA CCG CTG CTG CAC AAG ACC TCG 534 
Gly Lys Phe Leu Pro Leu Thr lie Ser Pro Leu Leu His Lys Thr Ser 
70 75 80 

5 

CGC AAA GCC TTG ACG CCA ACA CCG TCA CTT TCC CCG CTA ACA TCT CTA 5 82 

Arg Lys Ala Leu Thr Pro Thr Pro Ser Leu Ser Pro Leu Thr Ser Leu 
35 90 95 100 



10 GCA TGC CCG AAT TCC GGA ATT GGG CCA AGG GAA AGA TCG ACC TCG ACT 630 

Ala Cys Pro Asn Ser Gly He Gly Pro Arg Glu Arg Ser Thr Ser Thr 
105 110 115 



CCG ATT CCA TCG GCT GGT ACT TCA AGT ACC TTG ACC CAG CGG GTG CTA 678 
15 Pro He Pro Ser Ala Gly Thr Ser Ser Thr Leu Thr Gin Arg Val Leu 

120 125 130 

CAG AGT CTG CGC GCG CCG TCG GCG AGT ACT CGA AGA TCC CTG ACG GCC 726 
Gin Ser Leu Arg Ala Pro Ser Ala Ser Thr Arg Arg Ser Leu Thr Ala 
20 135 140 145 

TCG TCA AGT TCT CCG TCG ACG CAG AGA TAAGAGAGAT CTATAAC GAG 77 3 
Ser Ser Ser Ser Pro Ser Thr Gin Arg 
150 155 

25 

GAGTGCCCCG TCGTCACTGA CGTGTCCGTC CCCCTCGACG GCCGCCAGTG GAGCCTCTCG 83 3 

ATTTTCTCCT TTCCGATGTT CAGAACCGCC TACGTCGCCG TAGCGAACGT C GAGAAC AAG 8 93 

30 GAGATGTCGC TCGACGTTGT CAACGACCTC ATCGAGTGGC TCAACAATCT CGCCGACTGG 953 

CGTTATGTCG TTGACTCTGA ACAGTGGATT AACTTCACCA AT GAC ACC AC GTACTACGTC 1013 

CGCATCCGCG TTCTACGTCC AACCTACGAC GTTCCAGACC CCACAGAGGG CCTTGTTCGC 107 3 



35 



ACAGTCTCAG ACTACCGCCT CACTTATAAG GCGATAACAT GTGAAGCCAA CATGCCAACA 1133 



202 



CTCGTCGACC AAGGCTTTTG GATCGGCGGC CAGTACGCTC TCACCCCGAC TAGCCTACCG 1193 



CAGTACGACG TCAGCGAGGC CTACGCTCTG CACACTTTGA CCTTCGCCAG ACCATCCAGC 12 5 3 



5 GCCGCTGCAC TCGCGTTTGT GTGGGCAGGT TTGCCACAGG GTGGCACTGC GCCTGCAGGC 1313 



ACTCCAGCCT GGGAGCAGGC ATCCTCGGGT GGCTACCTCA CCTGGCGCCA CAACGGTACT 137 3 



10 



20 



30 



ACTTTCCCAG CTGGCTCCGT TAGCTACGTT CTCCCTGAGG GTTTCGCCCT TGAGCGCTAC 14 3 3 



GACCCGAACG ACGGCTCTTG GACCGACTTC GCTTCCGCAG GAGACACCGT CACTTTCCGG 14 93 



CAGGTCGCCG TCGACGAGGT CGTTGTGACC AACAACCCCG CCGGCGGCGG CAGCGCCCCC 1553 

/ 

15 ACCTTCACCG TGAGAGTGCC CCCTTCAAAC GCTTACACCA ACACCGTGTT TAGGAACACG 1613 



CTCTTAGAGA CTCGACCCTC CTCTCGTAGG CTCGAACTCC CTATGCCACC TGCTGACTTT 167 3 



GGACAGACGG TCGCCAACAA CCC GAAGATC GAGCAGTCGC TTCTTAAAGA AACACTTGGC 17 33 



TGCTATTTGG TCCACTCCAA AATGCGAAAC CCCGTTTTCC AGCTCACGCC AGCCAGCTCC 17 93 



TTTGGCGCCG TTTCCTTCAA CAATCC GGGT TATGAGCGCA CACGCGACCT CCCGGACTAC 18 53 



25 ACTGGCATCC GTGACTCATT CGACCAGAAC ATGTCCACCG CTGTGGCCCA CTTCCGCTCA 1913 



CTCTCCCACT CCTGCAGTAT CGTCACTAAG ACCTACCAGG GTTGGGAAGG CGTCACGAAC 197 3 



GTCAACACGC CTTTCGGCCA ATTCGCGCAC GCGGGCCTCC T CAAGAAT GA GGAGATCCTC 2 033 



TGCCTCGCCG ACGACCTGGC CACCCGTCTC ACAGGTGTCT ACCCCGCCAC TGACAACTTC 2 0 93 



GCGGCCGCCG TTTCTGCCTT CGCCGCGAAC ATGCTGTCCT CCGTGCTGAA GTCGGAGGCA 2153 



35 ACGTCCTCCA TCATCAAGTC CGTTGGCGAG ACTGCCGTCG GCGCGGCTCA GTCCGGCCTC 2213 
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GCGAAGCTAC CCGGACTGCT AATGAGTGTA CCAGGGAAGA TTGCCGCGCG TGTCCGCGCG 22 7 3 



CGCCGAGCGC GCCGCCGCGC CGCTCGTGCC AATTAGTTTG CTCGCTCCTG TTTCGCCGTT 2 3 33 



TCGTAAAACG GCGTGGTCCC GCACATTACG CGTACCCTAA AGACTCTGGT GAGTCCCCGT 2 393 



C GT T AC AC GA CGGGTCTGCC GCGGTTCGAT TCCATTCCCA AGC GGCAAGA AGGACGTAGT 24 53 



TAGCTCTGCG TCCCTCGGGA TACCA 

10 



204 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



Met Ser Glu His 
1 

Thr Leu Ala Leu 

20 

Glu Trp Arg His 
35 

Gly Ser Ala Pro 
50 

Val Gly Pro Glu 
65 

His Lys Thr Ser 

Leu Thr Ser Leu 
100 

Ser Thr Ser Thr 



Thr He Ala His 
5 

He Pro Pro Glu 

Ser Asp Leu Thr 
40 

Thr Pro Ser Pro 

55 

Gly Lys Phe Leu 

70 

Arg Lys Ala Leu 
85 

Ala Cys Pro Asn 
Pro He Pro Ser 



Ser He Thr Leu 
10 

Pro Glu Ala Gly 
25 

Thr Val Ala Glu 

Ser Met Val Glu 
60 

Pro Leu Thr He 
75 

Thr Pro Thr Pro 
90 

Ser Gly He Gly 
105 

Ala Gly Thr Ser 



Pro Pro Gly Tyr 

15 

Trp Glu Met Leu 
30 

Pro Val Thr Phe 
45 

Glu Thr Asn Gly 

Ser Pro Leu Leu 
80 

Ser Leu Ser Pro 
95 

Pro Arg Glu Arg 
110 

Ser Thr Leu Thr 
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5 



10 



15 



20 



25 



30 



35 



Gin Arg Val Leu Gin Ser Leu Arg Ala Pro Ser Ala Ser Thr Arg Arg 
130 135 140 

Ser Leu Thr Ala Ser Ser Ser Ser Pro Ser Thr Gin Arg 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(IX) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 366.. 2306 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

GTTTTTCTTT CTTTACCAAG TGT GGTAAAA TT TAAAC AAA GAAGAAAACC AGGACCGTAA 60 

CCCGGCCCTT ACACACCTCG AGTCCGTGAC CACCGGATTA TACGTCGCCC ACCACACGGC 12 0 

GCCTTTTCCG ACCACTCTCG AGAGTCGTTG GGAGTTTCGT CCGTGACCAC CCGGTTGGCA 180 

GTCGACAGAC GCTTCCGGAC CACTAGAACC TCCTCGAGCG ACGCACACAC AGCACACACA 240 

CCGCCTTAGC TGCACCTACG GCAGCGTTGA TAGCGCGGAT TTATGAGCGA GCACACCATC 3 00 



206 



10 



GCCCACTCCA TCACATTACC ACCCGGTTAC ACCCTTGCCC TAATACCCCC TGAACCTGAA 

GCAGG ATG GGA GAT GCT GGA GTG GCG TCA CAG CGA CCT CAC AAC CGT 
Met Gly Asp Ala Gly Val Ala Ser Gin Arg Pro His Asn Arg 
15 10 

CGC GGA ACC CGT AAC GTT CGG GTC AGC GCC AAC ACC GTC ACC GTC AAT 
Arg Gly Thr Arg Asn Val Arg Val Ser Ala Asn Thr Val Thr Val Asn 
15 20 25 30 

GGT AGA AGA AAC CAA CGG CGT CGG ACC GGA AGG CAA GTT TCT CCC CCT 
Gly Arg Arg Asn Gin Arg Arg Arg Thr Gly Arg Gin Val Ser Pro Pro 
35 40 45 

GAC AAT TTC ACC GCT GCT GCA CAA GAC CTC GCG CAA AGC CTT GAC GCC 
Asp Asn Phe Thr Ala Ala Ala Gin Asp Leu Ala Gin Ser Leu Asp Ala 
50 55 60 



AAC ACC GTC ACT TTC CCC GCT AAC ATC TCT AGC ATG CCC GAA TTC CGG 
20 Asn Thr Val Thr Phe Pro Ala Asn lie Ser Ser Met Pro Glu Phe Arg 

65 70 75 



15 



207 



AAT TGG GCC AAG GGA AAG ATC GAC CTC GAC TCC GAT TCC ATC GGC TGG 64 7 

Asn Trp Ala Lys Gly Lys He Asp Leu Asp Ser Asp Ser He Gly Trp 
80 85 90 

5 

TAC TTC AAG TAG CTT GAC CCA GCG GGT GCT ACA GAG TCT GCG CGC GCC 695 

Tyr Phe Lys Tyr Leu Asp Pro Ala Gly Ala Thr Glu Ser Ala Arg Ala 
95 100 105 110 

10 GTC GGC GAG TAC TCG AAG ATC CCT GAC GGC CTC GTC AAG TTC TCC GTC 7 43 

Val Gly Glu Tyr Ser Lys He Pro Asp Gly Leu Val Lys Phe Ser Val 
115 120 125 

GAC GCA GAG ATA AGA GAG ATC TAT AAC GAG GAG TGC CCC GTC GTC ACT 7 91 

15 Asp Ala Glu He Arg Glu He Tyr Asn Glu Glu Cys Pro Val Val Thr 

130 135 140 

GAC GTG TCC GTC CCC CTC GAC GGC CGC CAG TGG AGC CTC TCG ATT TTC 839 
Asp Val Ser Val Pro Leu Asp Gly Arg Gin Trp Ser Leu Ser He Phe 
20 145 150 155 

TCC TTT CCG ATG TTC AGA ACC GCC TAC GTC GCC GTA GCG AAC GTC GAG 887 
Ser Phe Pro Met Phe Arg Thr Ala Tyr Val Ala Val Ala Asn Val Glu 
160 165 170 

AAC AAG GAG ATG TCG CTC GAC GTT GTC AAC GAC CTC ATC GAG TGG CTC 935 
Asn Lys Glu Met Ser Leu Asp Val Val Asn Asp Leu He Glu Trp Leu 
175 160 185 190 

30 AAC AAT CTC GCC GAC TGG CGT TAT GTC GTT GAC TCT GAA CAG TGG ATT 98 3 

Asn Asn Leu Ala Asp Trp Arg Tyr Val Val Asp Ser Glu Gin Trp He 
195 200 205 

AAC TTC ACC AAT GAC ACC ACG TAC TAC GTC CGC ATC CGC GTT CTA CGT 1031 
35 Asn Phe Thr Asn Asp Thr Thr Tyr Tyr Val Arg He Arg Val Leu Arg 

210 215 220 



25 
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20 



25 



30 



35 



CCA ACC TAC GAC GTT CCA GAC CCC AC A GAG GGC CTT GTT CGC AC A GTC 107 9 

Pro Thr Tyr Asp Val Pro Asp Pro Thr Glu Gly Leu Val Arg Thr Val 
225 230 235 

TCA GAC TAC CGC CTC ACT TAT AAG GCG ATA ACA TGT GAA GCC AAC ATG 1127 
Ser Asp Tyr Arg Leu Thr Tyr Lys Ala He Thr Cys Glu Ala Asn Met 
240 245 250 

CCA ACA CTC GTC GAC CAA GGC TTT TGG ATC GGC GGC CAG TAC GCT CTC 117 5 

Pro Thr Leu Val Asp Gin Gly Phe Trp He Gly Gly Gin Tyr Ala Leu 
255 260 265 270 

ACC CCG ACT AGC CTA CCG CAG TAC GAC GTC AGC GAG GCC TAC GCT CTG 122 3 

Thr Pro Thr Ser Leu Pro Gin Tyr Asp Val Ser Glu Ala Tyr Ala Leu 
275 280 285 

CAC ACT TTG ACC TTC GCC AGA CCA TCC AGC GCC GCT GCA CTC GCG TTT 1271 
His Thr Leu Thr Phe Ala Arg Pro Ser Ser Ala Ala Ala Leu Ala Phe 
290 295 300 

GTG TGG GCA GGT TTG CCA CAG GGT GGC ACT GCG CCT GCA GGC ACT CCA 1319 
Val Trp Ala Gly Leu Pro Gin Gly Gly Thr Ala Pro Ala Gly Thr Pro 
305 310 315 

GCC TGG GAG CAG GCA TCC TCG GGT GGC TAC CTC ACC TGG CGC CAC AAC 136 7 

Ala Trp Glu Gin Ala Ser Ser Gly Gly Tyr Leu Thr Trp Arg His Asn 
320 325 330 

GGT ACT ACT TTC CCA GCT GGC TCC GTT AGC TAC GTT CTC CCT GAG GGT 1415 
Gly Thr Thr Phe Pro Ala Gly Ser Val Ser Tyr Val Leu Pro Glu Gly 
335 340 345 350 

TTC GCC CTT GAG CGC TAC GAC CCG AAC GAC GGC TCT TGG ACC GAC TTC 14 63 

Phe Ala Leu Glu Arg Tyr Asp Pro Asn Asp Gly Ser Trp Thr Asp Phe 
355 360 365 
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15 



GCT TCC GCA GGA GAC ACC GTC ACT TTC CGG CAG GTC GCC GTC GAC GAG 1511 
Ala Ser Ala Gly Asp Thr Val Thr Phe Arg Gin Val Ala Val Asp Glu 
370 375 380 

GTC GTT GTG ACC AAC AAC CCC GCC GGC GGC GGC AGC GCC CCC ACC TTC 1559 
Val Val Val Thr Asn Asn Pro Ala Gly Gly Gly Ser Ala Pro Thr Phe 
385 390 395 

ACC GTG AGA GTG CCC CCT TCA AAC GCT TAC ACC AAC ACC GTG TTT AGG 160 7 

Thr Val Arg Val Pro Pro Ser Asn Ala Tyr Thr Asn Thr Val Phe Arg 
400 405 410 

AAC ACG CTC TTA GAG ACT CGA CCC TCC TCT CGT AGG CTC GAA CTC CCT 1655 
Asn Thr Leu Leu Glu Thr Arg Pro Ser Ser Arg Arg Leu Glu Leu Pro 
415 420 425 430 

ATG CCA CCT GCT GAC TTT GGA CAG ACG GTC GCC AAC AAC CCG AAG ATC 17 03 

Met Pro Pro Ala Asp Phe Gly Gin Thr Val Ala Asn Asn Pro Lys lie 
435 440 445 

GAG CAG TCG CTT CTT AAA GAA ACA CTT GGC TGC TAT TTG GTC CAC TCC 17 51 

Glu Gin Ser Leu Leu Lys Glu Thr Leu Gly Cys Tyr Leu Val His Ser 
450 455 460 

AAA ATG CGA AAC CCC GTT TTC CAG CTC ACG CCA GCC AGC TCC TTT GGC 17 9 9 

Lys Met Arg Asn Pro Val Phe Gin Leu Thr Pro Ala Ser Ser Phe Gly 
465 470 475 



GCC GTT TCC TTC AAC AAT CCG GGT TAT GAG CGC ACA CGC GAC CTC CCG 18 47 

30 Ala Val Ser Phe Asn Asn Pro Gly Tyr Glu Arg Thr Arg Asp Leu Pro 

480 435 490 



GAC TAC ACT GGC ATC CGT GAC TCA TTC GAC CAG AAC ATG TCC ACC GCT 
Asp Tyr Thr Gly He Arg Asp Ser Phe Asp Gin Asn Met Ser Thr Ala 
495 500 505 510 



20 



25 



35 



210 



GTG GCC CAC 
Val Ala His 

5 ACC TAC CAG 

Thr Tyr Gin 

CAA TTC GCG 
10 Gin Phe Ala 

545 

GCC GAC GAC 
Ala Asp Asp 
15 560 

AAC TTC GCG 
Asn Phe Ala 
575 

20 

GTG CTG AAG 
Val Leu Lys 

25 ACT GCC GTC 

Thr Ala Val 

CTA ATG AGT 
30 Leu Met Ser 

625 



TTC CGC TCA CTC TCC 
Phe Arg Ser Leu Ser 
515 

GGT TGG GAA GGC GTC 
Gly Trp Glu Gly Val 
530 

CAC GCG GGC CTC CTC 
His Ala Gly Leu Leu 
550 

CTG GCC ACC CGT CTC 
Leu Ala Thr Arg Leu 
565 

GCC GCC GTT TCT GCC 
Ala Ala Val Ser Ala 
580 

TCG GAG GCA ACG TCC 
Ser Glu Ala Thr Ser 
595 

GGC GCG GCT CAG TCC 
Gly Ala Ala Gin Ser 
610 

GTA CCA GGG AAG ATT 
Val Pro Gly Lys He 
630 



CAC TCC TGC AGT ATC 
His Ser Cys Ser He 
520 

ACG AAC GTC AAC ACG 
Thr Asn Val Asn Thr 
535 

AAG AAT GAG GAG ATC 
Lys Asn Glu Glu He 
555 

ACA GGT GTC TAC CCC 
Thr Gly Val Tyr Pro 
570 

TTC GCC GCG AAC ATG 
Phe Ala Ala Asn Met 
535 

TCC ATC ATC AAG TCC 
Ser He He Lys Ser 
600 

GGC CTC GCG AAG CTA 
Gly Leu Ala Lys Leu 
615 

GCC GCG CGT GTC CGC 
Ala Ala Arg Val Arg 
635 



GTC ACT AAG 1943 

Val Thr Lys 
525 

CCT TTC GGC 19 91 

Pro Phe Gly 

540 

CTC TGC CTC 2039 

Leu Cys Leu 

GCC ACT GAC 2 08 7 

Ala Thr Asp 

CTG TCC TCC 2135 
Leu Ser Ser 
590 

GTT GGC GAG 218 3 

Val Gly Glu 

605 

CCC GGA CTG 2231 

Pro Gly Leu 

620 

GCG CGC CGA 227 9 

Ala Arg Arg 



35 



GCG CGC CGC CGC GCC GCT CGT GCC AAT TAGTTTGCTC GCTCCTGTTT 
Ala Arg Arg Arg Ala Ala Arg Ala Asn 
640 645 



2326 
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CGCCGTTTCG TAAAACGGCG TGGTCCCGCA CATTACGCGT ACC C T AAAGA CTCTGGTGAG 23 8 6 



TCCCCGTCGT TACACGACGG GTCTGCCGCG GTTCGATTCC ATTCCCAAGC GGCAAGAAGG 2446 



AC GT AGT TAG CTCTGCGTCC CTCGGGATAC CA 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 647 amino acids 

(B) TYPE: amino acid 
£D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



Met Gly Asp Ala Gly Val Ala Ser Gin Arg Pro His Asn Arg Arg Gly 
15 10 15 

Thr Arg Asn Val Arg Val Ser Ala Asn Thr Val Thr Val Asn Gly Arg 
20 25 30 

Arg Asn Gin Arg Arg Arg Thr Gly Arg Gin Val Ser Pro Pro Asp Asn 
35 40 45 



30 Phe Thr Ala Ala Ala Gin Asp Leu Ala Gin Ser Leu Asp Ala Asn Thr 

50 55 60 

Val Thr Phe Pro Ala Asn lie Ser Ser Met Pro Glu Phe Arg Asn Trp 

65 70 75 80 



Ala Lys Gly Lys lie Asp Leu Asp Ser Asp Ser lie Gly Trp Tyr Phe 



212 



85 90 95 

Lys Tyr Leu Asp Pro Ala Gly Ala Thr Glu Ser Ala Arg Ala Val Gly 
100 105 110 

Glu Tyr Ser Lys lie Pro Asp Gly Leu Val Lys Phe Ser Val Asp Ala 
115 120 125 

Glu lie Arg Glu lie Tyr Asn Glu Glu Cys Pro Val Val Thr Asp Val 
130 135 140 

Ser Val Pro Leu Asp Gly Arg Gin Trp Ser Leu Ser lie Phe Ser Phe 
145 150 155 160 

Pro Met Phe Arg Thr Ala Tyr Val Ala Val Ala Asn Val Glu Asn Lys 
165 170 175 

Glu Met Ser Leu Asp Val Val Asn Asp Leu He Glu Trp Leu Asn Asn 
180 185 190 

Leu Ala Asp Trp Arg Tyr Val Val Asp Ser Glu Gin Trp He Asn Phe 

195 200 205 

Thr Asn Asp Thr Thr Tyr Tyr Val Arg He Arg Val Leu Arg Pro Thr 
210 215 220 

Tyr Asp Val Pro Asp Pro Thr Glu Gly Leu Val Arg Thr Val Ser Asp 
225 230 235 240 

Tyr Arg Leu Thr Tyr Lys Ala He Thr Cys Glu Ala Asn Met Pro Thr 
245 250 255 

Leu Val Asp Gin Gly Phe Trp He Gly Gly Gin Tyr Ala Leu Thr Pro 
260 265 270 

Thr Ser Leu Pro Gin Tyr Asp Val Ser Glu Ala Tyr Ala Leu His Thr 



213 



Leu Thr Phe Ala Arg Pro Ser Ser Ala Ala Ala Leu Ala Phe Val Trp 
290 295 300 

Ala Gly Leu Pro Gin Gly Gly Thr Ala Pro Ala Gly Thr Pro Ala Trp 
305 310 315 320 

Glu Gin Ala Ser Ser Gly Gly Tyr Leu Thr Trp Arg His Asn Gly Thr 
325 330 335 

Thr Phe Pro Ala Gly Ser Val Ser Tyr Val Leu Pro Glu Gly Phe Ala 
340 345 350 

Leu Glu Arg Tyr Asp Pro Asn Asp Gly Ser Trp Thr Asp Phe Ala Ser 
355 360 365 

Ala Gly Asp Thr Val Thr Phe Arg Gin Val Ala Val Asp Glu Val Val 
370 375 380 

Val Thr Asn Asn Pro Ala Gly Gly Gly Ser Ala Pro Thr Phe Thr Val 
385 390 395 400 

Arg Val Pro Pro Ser Asn Ala Tyr Thr Asn Thr Val Phe Arg Asn Thr 
405 410 415 

Leu Leu Glu Thr Arg Pro Ser Ser Arg Arg Leu Glu Leu Pro Met Pro 
420 425 430 

Pro Ala Asp Phe Gly Gin Thr Val Ala Asn Asn Pro Lys He Glu Gin 
435 440 445 

Ser Leu Leu Lys Glu Thr Leu Gly Cys Tyr Leu Val His Ser Lys Met 
450 455 460 

Arg Asn Pro Val Phe Gin Leu Thr Pro Ala Ser Ser Phe Gly Ala Val 
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Ser Phe Asn Asn Pro Gly Tyr Glu Arg Thr Arg Asp Leu Pro Asp Tyr 
435 490 495 

Thr Gly lie Arg Asp Ser Phe Asp Gin Asn Met Ser Thr Ala Val Ala 
500 505 510 

His Phe Arg Ser Leu Ser His Ser Cys Ser lie Val Thr Lys Thr Tyr 
515 520 525 

Gin Gly Trp Glu Gly Val Thr Asn Val Asn Thr Pro Phe Gly Gin Phe 
530 535 540 

Ala His Ala Gly Leu Leu Lys Asn Glu Glu lie Leu Cys Leu Ala Asp 
545 550 555 560 

Asp Leu Ala Thr Arg Leu Thr Gly Val Tyr Pro Ala Thr Asp Asn Phe 

565 570 575 

Ala Ala Ala Val Ser Ala Phe Ala Ala Asn Met Leu Ser Ser Val Leu 
580 585 590 

Lys Ser Glu Ala Thr Ser Ser lie lie Lys Ser Val Gly Glu Thr Ala 
595 600 605 

Val Gly Ala Ala Gin Ser Gly Leu Ala Lys Leu Pro Gly Leu Leu Met 
610 615 620 

Ser Val Pro Gly Lys He Ala Ala Arg Val Arg Ala Arg Arg Ala Arg 
625 630 635 640 



Arg Arg Ala Ala Arg Ala Asn 
645 
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(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 247 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 
15 (B) LOCATION: 283.. 2307 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

20 GTTTTTCTTT CTTTACCAAG TGTGGTAAAA T T T AAAC AAA GAAGAAAACC AGGACCGTAA 60 

CCCGGCCCTT ACACACCTCG AGTCCGTGAC CACCGGATTA TACGTCGCCC ACCACACGGC 120 



GCCTTTTCCG ACCACTCTCG AGAGTCGTTG GGAGTTTCGT CCGTGACCAC CCGGTTGGCA 180 



GTCGACAGAC GCTTCCGGAC CAC TAGAAC C TCCTCGAGCG ACGCACACAC AGCACACACA 240 



CCGCCTTAGC TGCACCTACG GCAGCGTTGA TAGCGCGGAT TT ATG AGC GAG CAC 2 94 

Met Ser Glu His 

30 i 

ACC ATC GCC CAC TCC ATC ACA TTA CCA CCC GGT TAC ACC CTT GCC CTA 342 
Thr lie Ala His Ser lie Thr Leu Pro Pro Gly Tyr Thr Leu Ala Leu 
5 10 15 20 



ATA CCC CCT GAA CCT GAA GCA GGA TGG GAG ATG CTG GAG TGG CGT CAC 3 90 
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lie Pro Pro Glu Pro Glu Ala Gly Trp Glu Met Leu Glu Trp Arg His 
25 30 35 

AGC GAC CTC ACA ACC GTC GCG GAA CCC GTA ACG TTC GGG TCA GCG CCA 438 
5 Ser Asp Leu Thr Thr Val Ala Glu Pro Val Thr Phe Gly Ser Ala Pro 

40 45 50 

ACA CCG TCA CCG TCA ATG GTA GAA GAA ACC AAC GGC GTC GGA CCG GAA 4 86 

Thr Pro Ser Pro Ser Met Val Glu Glu Thr Asn Gly Val Gly Pro Glu 
10 55 60 65 

GGC AAG TTT CTC CCC CTG ACA ATT TCA CCG CTG CTG CAC AAG ACC TCG 5 34 

Gly Lys Phe Leu Pro Leu Thr lie Ser Pro Leu Leu His Lys Thr Ser 
70 75 80 



15 



20 



CGC AAA GCC TTG ACG CCA ACA CCG TCA CTT TCC CCC GCT AAC ATC TCT 582 
Arg Lys Ala Leu Thr Pro Thr Pro Ser Leu Ser Pro Ala Asn lie Ser 
85 SO 95 100 

AGC ATG CCC GAA TTC CGG AAT TGG GCC AAG GGA AAG ATC GAC CTC GAC 630 
Ser Met Pro Glu Phe Arg Asn Trp Ala Lys Gly Lys lie Asp Leu Asp 
105 110 115 



TCC GAT TCC ATC GGC TGG TAC TTC AAG TAC CTT GAC CCA GCG GGT GCT 67 8 

25 Ser Asp Ser He Gly Trp Tyr Phe Lys Tyr Leu Asp Pro Ala Gly Ala 

120 125 130 

ACA GAG TCT GCG CGC GCC GTC GGC GAG TAC TCG AAG ATC CCT GAC GGC 726 
Thr Glu Ser Ala Arg Ala Val Gly Glu Tyr Ser Lys He Pro Asp Gly 



30 



35 



135 140 145 

CTC GTC AAG TTC TCC GTC GAC GCA GAG ATA AGA GAG ATC TAT AAC GAG 

Leu Val Lys Phe Ser Val Asp Ala Glu He Arg Glu He Tyr Asn Glu 
150 155 160 

GAG TGC CCC GTC GTC ACT GAC GTG TCC GTC CCC CTC GAC GGC CGC CAG 
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Glu Cys Pro Val Val Thr Asp Val Ser Val Pro Leu Asp Gly Arg Gin 
165 170 175 180 

TGG AGC CTC TCG ATT TTC TCC TTT CCG ATG TTC AGA ACC GCC TAC GTC 87 0 

5 Trp Ser Leu Ser lie Phe Ser Phe Pro Met Phe Arg Thr Ala Tyr Val 

185 190 195 

GCC GTA GCG AAC GTC GAG AAC AAG GAG ATG TCG CTC GAC GTT GTC AAC 918 
Ala Val Ala Asn Val Glu Asn Lys Glu Met Ser Leu Asp Val Val Asn 
10 200 205 210 

GAC CTC ATC GAG TGG CTC AAC AAT CTC GCC GAC TGG CGT TAT GTC GTT 966 
Asp Leu lie Glu Trp Leu Asn Asn Leu Ala Asp Trp Arg Tyr Val Val 
215 220 225 

15 

GAC TCT GAA CAG TGG ATT AAC TTC ACC AAT GAC ACC ACG TAC TAC GTC 1014 
Asp Ser Glu Gin Trp lie Asn Phe Thr Asn Asp Thr Thr Tyr Tyr Val 
230 235 240 



20 



CGC ATC CGC GTT CTA CGT CCA ACC TAC GAC GTT CCA GAC CCC ACA GAG 
Arg lie Arg Val Leu Arg Pro Thr Tyr Asp Val Pro Asp Pro Thr Glu 
245 250 255 260 
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10 



15 



20 



25 



30 



35 



GGC CTT GTT CGC ACA GTC TCA GAC TAC CGC CTC ACT TAT AAG GCG ATA 1110 

Gly Leu Val Arg Thr Val Ser Asp Tyr Arg Leu Thr Tyr Lys Ala lie 

265 270 275 

ACA TGT GAA GCC AAC ATG CCA ACA CTC GTC GAC CAA GGC TTT TGG ATC 115 8 

Thr Cys Glu Ala Asn Met Pro Thr Leu Val Asp Gin Gly Phe Trp lie 

280 285 290 

GGC GGC CAG TAC GCT CTC ACC CCG ACT AGC CTA CCG CAG TAC GAC GTC 1206 

Gly Gly Gin Tyr Ala Leu Thr Pro Thr Ser Leu Pro Gin Tyr Asp Val 

295 300 305 



AGC GAG GCC TAC GCT CTG CAC ACT TTG ACC TTC GCC AGA CCA TCC AGC 12 54 

/ 

Ser Glu Ala Tyr Ala Leu His Thr Leu Thr Phe Ala Arg Pro Ser Ser 
310 315 320 



GCC GCT GCA CTC GCG TTT GTG TGG GCA GGT TTG CCA CAG GGT GGC ACT 1302 

Ala Ala Ala Leu Ala Phe Val Trp Ala Gly Leu Pro Gin Gly Gly Thr 

325 330 335 340 

GCG CCT GCA GGC ACT CCA GCC TGG GAG CAG GCA TCC TCG GGT GGC TAC 13 50 

Ala Pro Ala Gly Thr Pro Ala Trp Glu Gin Ala Ser Ser Gly Gly Tyr 
345 350 355 

CTC ACC TGG CGC CAC AAC GGT ACT ACT TTC CCA GCT GGC TCC GTT AGC 13 98 

Leu Thr Trp Arg His Asn Gly Thr Thr Phe Pro Ala Gly Ser Val Ser 
360 365 370 

TAC GTT CTC CCT GAG GGT TTC GCC CTT GAG CGC TAC GAC CCG AAC GAC 14 4 6 

Tyr Val Leu Pro Glu Gly Phe Ala Leu Glu Arg Tyr Asp Pro Asn Asp 

375 380 385 

GGC TCT TGG ACC GAC TTC GCT TCC GCA GGA GAC ACC GTC ACT TTC CGG 14 94 

Gly Ser Trp Thr Asp Phe Ala Ser Ala Gly Asp Thr Val Thr Phe Arg 
390 395 400 
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CAG GTC GCC GTC GAC GAG GTC GTT GTG ACC AAC AAC CCC GCC GGC GGC 
Gin Val Ala Val Asp Glu Val Val Val Thr Asn Asn Pro Ala Gly Gly 
405 410 415 420 

5 GGC AGC GCC CCC ACC TTC ACC GTG AGA GTG CCC CCT TCA AAC GCT TAC 

Gly Ser Ala Pro Thr Phe Thr Val Arg Val Pro Pro Ser Asn Ala Tyr 
425 430 435 

ACC AAC ACC GTG TTT AGG AAC ACG CTC TTA GAG ACT CGA CCC TCC TCT 
10 Thr Asn Thr Val Phe Arg Asn Thr Leu Leu Glu Thr Arg Pro Ser Ser 

440 445 450 

CGT AGG CTC GAA CTC CCT ATG CCA CCT GCT GAC TTT GGA CAG ACG GTC 
Arg Arg Leu Glu Leu Pro Met Pro Pro Ala Asp Phe Gly Gin Thr Val 
15 455 460 465 

GCC AAC AAC CCG AAG ATC GAG CAG TCG CTT CTT AAA GAA ACA CTT GGC 

Ala Asn Asn Pro Lys He Glu Gin Ser Leu Leu Lys Glu Thr Leu Gly 

470 475 480 

20 

TGC TAT TTG GTC CAC TCC AAA ATG CGA AAC CCC GTT TTC CAG CTC ACG 

Cys Tyr Leu Val His Ser Lys Met Arg Asn Pro Val Phe Gin Leu Thr 

485 490 495 500 

25 CCA GCC AGC TCC TTT GGC GCC GTT TCC TTC AAC AAT CCG GGT TAT GAG 

Pro Ala Ser Ser Phe Gly Ala Val Ser Phe Asn Asn Pro Gly Tyr Glu 
505 510 515 

CGC ACA CGC GAC CTC CCG GAC TAC ACT GGC ATC CGT GAC TCA TTC GAC 
30 Arg Thr Arg Asp Leu Pro Asp Tyr Thr Gly He Arg Asp Ser Phe Asp 

520 525 530 

CAG AAC ATG TCC ACC GCT GTG GCC CAC TTC CGC TCA CTC TCC CAC TCC 
Gin Asn Met Ser Thr Ala Val Ala His Phe Arg Ser Leu Ser His Ser 
35 535 540 545 
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TGC AGT ATC GTC ACT AAG ACC TAC CAG GGT TGG GAA GGC GTC ACG AAC 197 4 

Cys Ser He Val Thr Lys Thr Tyr Gin Gly Trp Glu Gly Val Thr Asn 
550 555 560 

5 GTC AAC ACG CCT TTC GGC CAA TTC GCG CAC GCG GGC CTC CTC AAG AAT 2022 

Val Asn Thr Pro Phe Gly Gin Phe Ala His Ala Gly Leu Leu Lys Asn 
565 570 575 580 

GAG GAG ATC CTC TGC CTC GCC GAC GAC CTG GCC ACC CGT CTC ACA GGT 2 07 0 

10 Glu Glu He Leu Cys Leu Ala Asp Asp Leu Ala Thr Arg Leu Thr Gly 

565 590 595 

GTC TAC CCC GCC ACT GAC AAC TTC GCG GCC GCC GTT TCT GCC TTC GCC 2118 
Val Tyr Pro Ala Thr Asp Asn Phe Ala Ala Ala Val Ser Ala Phe Ala 
15 600 605 610 

GCG AAC ATG CTG TCC TCC GTG CTG AAG TCG GAG GCA ACG TCC TCC ATC 2166 

Ala Asn Met Leu Ser Ser Val Leu Lys Ser Glu Ala Thr Ser Ser He 
615 620 625 

20 

ATC AAG TCC GTT GGC GAG ACT GCC GTC GGC GCG GCT CAG TCC GGC CTC 2214 

He Lys Ser Val Gly Glu Thr Ala Val Gly Ala Ala Gin Ser Gly Leu 
630 635 640 

25 GCG AAG CTA CCC GGA CTG CTA ATG AGT GTA CCA GGG AAG ATT GCC GCG 2262 

Ala Lys Leu Pro Gly Leu Leu Met Ser Val Pro Gly Lys He Ala Ala 
645 650 655 660 

CGT GTC CGC GCG CGC CGA GCG CGC CGC CGC GCC GCT CGT GCC AAT 2307 
30 Arg Val Arg Ala Arg Arg Ala Arg Arg Arg Ala Ala Arg Ala Asn 

665 670 675 

TAGTTTGCTC GCTCCTGTTT CGCCGTTTCG TAAAACGGCG TGGTCCCGCA CATTACGCGT 2 367 



35 



ACCC TAAAGA CTC T GGT GAG TCCCCGTCGT TAC AC GAC G G GTCTGCCGCG GTTCGATTCC 2427 
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ATTCCCAAGC GGCAAGAAGG ACGTAGTTAG CTCTGCGTCC CTCGGGATAC CA 



(2) INFORMATION FOR SEQ ID NO: 52: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



Met Ser Glu His 
1 

Thr Leu Ala Leu 
20 

Glu Trp Arg His 
35 

Gly Ser Ala Pro 
50 

Val Gly Pro Glu 
65 

His Lys Thr Ser 

Ala Asn lie Ser 
100 



Thr lie Ala His 
5 

lie Pro Pro Glu 

Ser Asp Leu Thr 
40 

Thr Pro Ser Pro 

55 

Gly Lys Phe Leu 
70 

Arg Lys Ala Leu 
85 

Ser Met Pro Glu 



Ser lie Thr Leu 
10 

Pro Glu Ala Gly 
25 

Thr Val Ala Glu 

Ser Met Val Glu 

60 

Pro Leu Thr lie 
75 

Thr Pro Thr Pro 
90 

Phe Arg Asn Trp 
105 



Pro Pro Gly Tyr 
15 

Trp Glu Met Leu 
30 

Pro Val Thr Phe 
45 

Glu Thr Asn Gly 

Ser Pro Leu Leu 
80 

Ser Leu Ser Pro 

95 

Ala Lys Gly Lys 
110 
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lie Asp Leu Asp Ser Asp Ser He Gly Trp Tyr Phe Lys Tyr Leu Asp 
115 120 125 

Pro Ala Gly Ala Thr Glu Ser Ala Arg Ala Val Gly Glu Tyr Ser Lys 
130 135 140 

He Pro Asp Gly Leu Val Lys Phe Ser Val Asp Ala Glu He Arg Glu 
145 150 155 160 

He Tyr Asn Glu Glu Cys Pro Val Val Thr Asp Val Ser Val Pro Leu 
165 170 175 

Asp Gly Arg Gin Trp Ser Leu Ser He Phe Ser Phe Pro Met Phe Arg 
180 185 190 

Thr Ala Tyr Val Ala Val Ala Asn Val Glu Asn Lys Glu Met Ser Leu 

195 200 205 

Asp Val Val Asn Asp Leu He Glu Trp Leu Asn Asn Leu Ala Asp Trp 
210 215 220 

Arg Tyr Val Val Asp Ser Glu Gin Trp He Asn Phe Thr Asn Asp Thr 
225 230 235 240 

Thr Tyr Tyr Val Arg He Arg Val Leu Arg Pro Thr Tyr Asp Val Pro 
245 250 255 

Asp Pro Thr Glu Gly Leu Val Arg Thr Val Ser Asp Tyr Arg Leu Thr 
260 265 270 

Tyr Lys Ala He Thr Cys Glu Ala Asn Met Pro Thr Leu Val Asp Gin 
275 280 285 

Gly Phe Trp He Gly Gly Gin Tyr Ala Leu Thr Pro Thr Ser Leu Pro 
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Gin Tyr Asp Val Ser Glu Ala Tyr Ala Leu His Thr Leu Thr Phe Ala 
305 310 315 320 

Arg Pro Ser Ser Ala Ala Ala Leu Ala Phe Val Trp Ala Gly Leu Pro 
325 330 335 

Gin Gly Gly Thr Ala Pro Ala Gly Thr Pro Ala Trp Glu Gin Ala Ser 
340 345 350 

Ser Gly Gly Tyr Leu Thr Trp Arg His Asn Gly Thr Thr Phe Pro Ala 
355 360 365 

Gly Ser Val Ser Tyr Val Leu Pro Glu Gly Phe Ala Leu Glu Arg Tyr 
370 375 380 

Asp Pro Asn Asp Gly Ser Trp Thr Asp Phe Ala Ser Ala Gly Asp Thr 
335 390 395 400 

Val Thr Phe Arg Gin Val Ala Val Asp Glu Val Val Val Thr Asn Asn 
405 410 415 

Pro Ala Gly Gly Gly Ser Ala Pro Thr Phe Thr Val Arg Val Pro Pro 
420 425 430 

Ser Asn Ala Tyr Thr Asn Thr Val Phe Arg Asn Thr Leu Leu Glu Thr 
435 440 445 

Arg Pro Ser Ser Arg Arg Leu Glu Leu Pro Met Pro Pro Ala Asp Phe 
450 455 460 

Gly Gin Thr Val Ala Asn Asn Pro Lys lie Glu Gin Ser Leu Leu Lys 
465 470 475 480 



Glu Thr Leu Gly Cys Tyr Leu Val His Ser Lys Met Arg Asn Pro Val 
485 490 495 
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Phe Gin Leu Thr Pro Ala Ser Ser Phe Gly Ala Val Ser Phe Asn Asn 
500 505 510 

Pro Gly Tyr Glu Arg Thr Arg Asp Leu Pro Asp Tyr Thr Gly lie Arg 
515 520 525 

Asp Ser Phe Asp Gin Asn Met Ser Thr Ala Val Ala His Phe Arg Ser 
530 535 540 

Leu Ser His Ser Cys Ser lie Val Thr Lys Thr Tyr Gin Gly Trp Glu 
545 550 555 560 

Gly Val Thr Asn Val Asn Thr Pro Phe Gly Gin Phe Ala His Ala Gly 
565 570 575 

Leu Leu Lys Asn Glu Glu lie Leu Cys Leu Ala Asp Asp Leu Ala Thr 
580 535 590 

Arg Leu Thr Gly Val Tyr Pro Ala Thr Asp Asn Phe Ala Ala Ala Val 

595 600 605 

Ser Ala Phe Ala Ala Asn Met Leu Ser Ser Val Leu Lys Ser Glu Ala 
610 615 620 

Thr Ser Ser He He Lys Ser Val Gly Glu Thr Ala Val Gly Ala Ala 
625 630 635 640 

Gin Ser Gly Leu Ala Lys Leu Pro Gly Leu Leu Met Ser Val Pro Gly 
645 650 655 

Lys lie Ala Ala Arg Val Arg Ala Arg Arg Ala Arg Arg Arg Ala Ala 
660 665 670 

Arg Ala Asn 
675 
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(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



GGGGATCCAC AGTTCTGCCT CCCCCGGACG GTAAATATAG GGGAACCATG GTCTAGAGG 



CLAIMS 



1. A capsovector for use in controlling insect pests, the cap so vector comprising a 
capsid protein of an insect small RNA virus encapsidating an insecticidal protein toxin, 

5 the capsid protein protecting the protein toxin from inactivation in the gut of an insect 
following ingestion of the capsovector by the insect. 

2. A capsovector as claimed in claim 1 in which the insect small RNA virus is 
HaSV. 

10 

3. A capsovector as claimed in claim 1 in which the capsid protein is P71 (SEQ 
ID No. 50) 

4. A capsovector as claimed in claim 1 in which the insecticidal toxin is of plant 
15 origin. 

5. A capsovector as claimed in claim 1 in which the insecticidal toxin is Ricin A 
or diptheria toxin. 

20 6. A capsovector for use in controlling insect pests, the capsovector comprising a 
capsid protein of an insect small RNA virus encapsidating a nucleic acid molecule 
which is insecticidal or which encodes an insecticidal protein toxin, the capsid protein 
protecting the protein toxin from inactivation in the gut of an insect following 
ingestion of the capsovector by the insect. 

25 

7. A capsovector as claimed in claim 6 in which the nucleic acid is RNA. 
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8. A capsovector as claimed in claim 7 in which the insect small RNA virus is 
HaSV. 
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9. A capsovector as claimed in claim 7 in which the capsid protein is P71 (SEQ 
ID No. 50) 

10. A capsovector as claimed in claim 7 in which the insecticidal toxin is of plant 
5 origin. 

11. A capsovector as claimed in claim 7 in which the insecticidal toxin is Ricin A 
or diptheria toxin. 

10 12. A capsovector as claimed in claim 7 in which the RNA is an antisense 
sequence, a ribozyme or a mimicking structure. 

13. A capsovector as claimed in claim 12 in which the mimicking structure is RNA 
hybridised so as to at least partially form double stranded RNA. 

15 

14. A capsovector as claimed in claim 6 in which the capsovector includes a further 
nucleic acid sequence, the further nucleic acid sequence encoding the capsid protein. 

15. An isolated nucleic acid molecule comprising a first sequence encoding at least 
20 one capsid protein of an insect small RNA virus and a second sequence which is 

insecticidal or which encodes an insecticidal protein toxin. 

16. An isolated nucleic acid molecule as claimed in claim 15 in which the nucleic 
acid is RNA. 

25 

17. An isolated nucleic acid molecule as claimed in claim 15 in which the insect 
small RNA virus is HaSV. 

18. An isolated nucleic acid molecule as claimed in claim 15 in which the capsid 
30 protein is P71 (SEQ ID No. 50) - 
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19. An isolated nucleic acid molecule as claimed in claim 15 in which the 
insecticidal toxin is of plant origin. 

20. An isolated nucleic acid molecule as claimed in claim 15 in which the 
5 insecticidal toxin is Ricin A. 

21. An isolated nucleic acid molecule as claimed in claim 15 in which the second 
sequence is an antisense sequence, a ribozyme or a mimicking structure. 

10 22. An isolated nucleic acid molecule as claimed in claim 21 in which the 
mimicking structure is double stranded RNA. 

23. An isolated nucleic acid molecule as claimed in claim 15 in which the 
insecticidal toxin is less toxic to plants than insects. 

15 

24. A transgenic plant resistant to insect attack comprising a genome or subgenome 
capable of expressing the nucleic acid molecule as claimed in claim 1 5 such that the 
transgenic plant produces capsid protein in which is encapsidated the nucleic acid 
molecule. 

20 

25. An isolated nucleic acid molecule comprising a first sequence encoding at least 
one capsid protein of an insect small RNA virus, a second sequence which is 
insecticidal or which encodes an insecticidal protein toxin and a third sequence 
positioned between the first and second sequence, the third sequence directing 

25 expression of the second sequence in an insect pest. 

26. An isolated nucleic acid molecule as claimed in claim 25 in which the third 
sequence is an IRES. 
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27. An isolated nucleic acid molecule as claimed in claim 25 in which the nucleic 
acid is RNA. 

28. An isolated nucleic acid molecule as claimed in claim 25 in which the insect 
5 small RNA virus is HaSV. 

29. An isolated nucleic acid molecule as claimed in claim 25 in which the capsid 
protein is P71 (SEQ ID No. 50) 

10 30. An isolated nucleic acid molecule as claimed in claim 25 in which the 
insecticidal toxin is of plant origin. 

31. An isolated nucleic acid molecule as claimed in claim 25 in which the 
insecticidal toxin is Ricin A or diptheria toxin. 

15 

32. A transgenic plant resistant to insect attack comprising a genome or subgenome 
capable of expressing the nucleic acid molecule as claimed in claim 25 such that the 
transgenic plant produces capsid protein in which is encapsidated the nucleic acid 
molecule. 



20 
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Abstract 

The present invention relates to an isolated small RNA virus capable of infecting 
insect species including Heliothis species, and to the nucleotide sequences and proteins 
encoded thereby. The invention contemplates uses of the virus in controlling insect 
attack in plants. 
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Map of HaSV RNA 1 clones 
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Map of HaSV RNA 2 clones 
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DECLARATION AND POWER OF ATTORNEY 
FOR PATENT APPLICATION 



As a below-named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name, 

I believe ! am the original, first and sole inventor (if only one name is listed below) or an original, first 
and Joint inventor (if plural names are listed below) of the subject matter which is claimed and for which 
a patent is sought on the invention entitled INSECT VIRUSES AND THEIR USES IN PROTECTING PLANTS 



the specification of which 

is attached hereto. 

was filed on June 7. 1995 as 
Application Serial No. 08/485 t 355 

and was amended on . 

(if applicable) 

I hereby state that I have reviewed and understand the contents of the above- identified specification, 
including the claims, as amended by any amendment referred to above. / 

I acknowledge the duty to disclose to the Patent Office all information known to me to be material to 
patentability as defined in 37 C.F.R. 1.56. 

I hereby claim foreign priority benefits under Title 35, United states Code, §1 19 of any foreign appt icat ion(s) 
for patent or inventor's certificate listed below and have also identified below any foreign application 
for patent or inventor's certificate having a filing date before that of the application on which priority 
is claimed: 



Prior Foreign Application(s) Priority Claimed 



PL4081/92 


Australia 


14 August 1992 


0 
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(Number) 


(Country) 


(Day/Month/Year Filed) 


Yes 


No . 
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(Number) 


(Country) 


(Day/Month/Year Filed) 


Yes 


No 
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(Number) 


(Country) 


(Day/Month/Year Filed) 


Yes 


No 



I hereby claim the benefit under Title 35, United States Code, §120 of any United States application(s) 
listed below and, insofar as the subject matter of each of the claims of this application is not disclosed 
in the prior United States application in the manner provided by the first paragraph of Title 35, United 
States Code, §112, I acknowledge the duty to disclose to the Patent Office all information known to me 
to be material to patentability as defined in 37 C.F.R. 1.56 which occurred between the filing date of 
the prior application and the national or PCT international filing date of this application: 

08/440,522 12 May 1995 Pending 

(Application Serial No.) (Filing Date) (Status) 

(patented, pending, abandoned) 



(Application Serial No.) (Filing Date) (Status) 

(patented, pending, abandoned) 
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one) 



□ 
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I hereby appoint the following attorneys to prosecute this application and to transact all business in the 
Patent and Trademark Office connected therewith: Harold C. Hohbach, Reg. No. 17,757; Aldo J. Test, Reg. 
No. 18,048; Thomas O. Herbert, Reg. No. 18,612; Donald N. Macintosh, Reg. No. 20,316; Jerry G. Wright, Reg. 
No. 20,165; Edward S. Wright, Reg. No. 24,903; David J. Brezner, Reg. No. 24,774; Richard E. Backus, Reg. 
No. 22,701; James A. Sheridan, Reg. No. 25,435; Robert B. Checkering, Reg. No. 24,286; Gary S. Williams, 
Reg. No. 31,066; Richard F. Trecartin, Reg. No. 31,801; C. Michael Zimmerman, Reg. No. 20,451; Walter H. 
Dreger, Reg. No. 24,190; Steven F. Caserza, Reg. No. 29,780; 



provided that if any one of said attorneys ceases being affiliated with the law firm of Flehr, Hohbach, Test, 
Albritton& Herbert as partner, employee or of counsel, such attorney's appointment as attorney and all powers 
derived therefrom shall terminate on the date such attorney ceases being so affiliated. 

Direct all telephone calls to Walter H. Dreger at (415 ) 781-1989. 

Address all correspondence to: 

FLEHR, HOHBACH, TEST, 
AL BR IT TON & HERBERT 
Svite 3400, Four Embarcadero Center 
San Francisco, California 94111 

File No. A-58631-2/WHP/LEA 

I hereby declare that all statements made herein of my own knowledge are true and that all statements made 
on information and belief are believed to be true; and further that these statements were made with the 
knowledge that willful false statements and the like so made are punishable by fine or Imprisonment, or both, 
under Title 18, United States Code, §1001 and that such willful false statements may jeopardize the validity 
of the application or any patent issued thereon. 

Full name of sole or 
first inventor: 



Inventor's signature: 
Date: 



Residence: 



Citizenship: 

Post Office Address: 



Full name of second joint 
inventor, if any: 

Inventor's signature: 

Date: 



Residence: 



Citizenship: 

Post Office Address: 



Peter Daniel Christian 



tyaetemu Australian Capital Territory, 2601 Australia 



Great Britain 



rynph fl gi . Australian Capital Territory, 2601 Australia 



Karl Hiyenrich Julius Gordon 



J/ <~A (iWv. 



Meston, Australian Capital Territory, 2611 Australia 



Austral i a/Germany 



18 Chevalier Street " 



Weston, Australian Capital Territory, 2611 Australia 



Form No. 1.01 



DECLARATION AND POWER OF ATTORNEY 
FOR PATENT APPLICATION 
Page 2 



02/93 



Full name of third joint 



inventor, if any: 
Inventor's signature: 
Date: 




Residence: Chapman, Australian Capital Territory, 2611 Australia 

Citizenship: United States 

Post Office Address: 3 Garner Place 



Chapman. Australian Capital Territory, 2611 Australia 
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